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INTRODUCTION 


A prototype balance study was conducted on earth [SMEAT] prior 

, f* 

to the balance studies conducted in Skylab itself. The data collected 
were the daily dietary intake of 6 minerals and nitrogen, and the fecal 
and urinary outputs on each of three astronauts. Our goals have been to 
establish methods of analysis that would allow computing the net balances 
along with the error bounds of these estimates; to search for structure 
in the data which might allow a tightening of these error bounds and 
to possibly improve the design of future balance studies; to evaluate 
the biological implications of the SMEAT data themselves and relative 
to data which will be obtained in Skylab or other metabolic balance 
studies . 

SUMMARY OF CONCLUSIONS 

We have set out the essential statistical issues, showing what 
quantities need to be estimated and establishing the scope of the 
inference associated with alternative variance estimates. The procedures 
for obtaining the final variability due both to errors of measurement 
and "total error" (total = measurement and biological variability) are 
exhibited. The previous document by Arnold [1] dealt only with the former, 
very restricted, issue. 

Quantitative estimates of both the measurement and total variance 
in the net average apparent retention as a function of the duration of 
the study have been obtained. These have implications in the design of 
future balance studies. The mean net retention and its variance establish 
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the reproducibility of the balance study for each of the substances considered 
leading to consistent results across subjects. We establish that no inter- 
dependencies or periodicities allow significant reductions of the above 

. ■«' 

variance estimates. 

The "biological significance" of the net retentions which have 
been observed is considered. We conclude, unfortunately , that the inter- 
pretation of balance studies of this kind present unresolvable difficulties 
and it is uncertain whether this study [SMEAT] can serve as a base line 
for the interpretation of the data from Sky lab. 

SECTION I 

The Data Base . We have concentrated on the SMEAT data available 
for nitrogen (N) , calcium (Ca) and potassium (K) . The principle variables 
in the analysis are, for each of 100 days, the total content of the daily 
diet and the urine and feces. Two features of the data — [3 astronauts x 
3 minerals x 1 input + 2 excretory modes] = 27 data series — that conditioned 
our investigation are: the diets had an approximate (but not equal) 6 day 

periodicity with only small variations among the 6 days and the fecal 
series had highly fluctuating values with zero output on many days. [The 
series also contained a few irregularities like missing values, duplicate 
readings, etc. Individual judgments were made regarding the most appropriate 
single valyie to replace each of these irregularities; using the sample 
information efficiently. There were not many such problems.] 

Definitions and Statistical Properties of the Fundamental Balance 
Quantities . If 1^, U^, and represent the mineral content of a particular 
mineral (in g, say) in the i th day's diet, urine and feces, respectively, 


then we may define the net retention on day i as 


(1) NET ± “ Ij. * U ± - F ± 

and the average cumulative retention through day k as 

, f 

k 

E NETi 

(2) CUM k = 


Regarding 1^, and F^ as random variables possessing a mean 
(E) and a variance (G 2 ), the same for all i, we find by the rules of 
elementary statistics that 


(3) 


E(CUM k ) 


k 

£ E (NETf > 

i=l 

k 


k * E (NET ) 
k 


= E (NET) 


and similarly 

(4) . a 2 (CO !M k ) = !i-2hssl . 2i£S) 


Also notice that the right-hand sides are given by 

(5) E (NET) = E(I) - E (U) - E (F) 

and . 

(6) a 2 (net) = a 2 (i) + a 2 (u) + a 2 (F). 

We suppress the subscript i wherever possible above by the 
assumption that E and a 2 of NET^ are the same for all i, equal 
respectively to E (NET) and a 2 (NET) . 

In a balance study of duration k days, the principal quantities 
whose estimates we desire are 

E(CUM k ) and a 2 (CUM k ) 

♦ 

We may obtain confidence intervals for estimates of the average 
retention E(NET) as a function of MBE duration as, in the conventional way. 
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(7) E(CUM k ) ± 2 / a 2 (CUM k ) 

where the "hats" denote data-based estimates of the corresponding true 
parameters . 

Equations (3) through (7) demonstrate, then, that to establish 
error bounds on the principal quantity of interest, E (NET) , is effectively 
a question of finding estimates of 

a 2 (i) , a 2 (u) , a 2 (F) , 

the variances of the mineral contents in one day's diet — urine and feces. 

The SMEAT experience consisted of "pre-, in- and post-chamber 
phases" and later discussion in fact demonstrates that it is not always 
appropriate to assume that the above quantities cr 2 are independent of 
the day-number i. But in the present treatment we find this assumption 
tenable so long as i is not too near 1 or 100, i.e., so long as we 
eliminate the highly variable beginning and end of the experiment. 

The issue of the independence of E on i has also to do 
with the important issue of trend in net retention over time which we 
will return to later. 

Scope of Desired Inference Determines Appropriate Estimates of 
Variance — Status of the Arnold Document [1] ♦ We may regard the data 
series for I, U, and F in essentially two distinct ways; the appropriate 
estimates 

a 2 (i), a 2 (u) , a 2 (F) 

will be correspondingly different and the scopes of the corresponding 
inferences will be also. 


The Restricted Inference , This corresponds to answering a 
question that may be put this way: "What is the estimate of this 

astronaut's net retention in this particular metabolic balance 

>4 

experiment (MBE) ?" If all quantities were measured with perfect 
precision (and if no "edge effects" occurred because of where the 
starting and the stopping of the MBE "caught" the I, U, and F series), 
then this question could be answered with perfect precision, i.e., the 
confidence interval (7) would consist of the single point 


e(cum d ) = 


D 

l NET. 
i=l 1 

D 


where 

D - number of days in MBE 

Status of the Arnold Document [1] . Absent perfect precision 
(but still ignoring edge effects), the width of (7) is determined by 
estimates of the errors of measurement in 1^, U^, and F^. 

This is the way Arnold [1] regarded the NASA MBE's; he set forth 
a model that was intended to allow obtaining estimates of 

0 2 (i), a 2 (u) , a 2 (F) 


where a 2 now denotes measurement error only. He then asserted that this 
is the only source of variability that need be taken into account. 

The present study regards the Arnold document as only relevant to 
the narrow question stated above and asserts that this is in fact Only a 
’rather specialized and peripheral issue. This section sets forth what we 
take to be the central question at issue. Appendix II, however, does build 
upon the model of the Arnold document, changing it somewhat and using the 
data themselves (the U and F series as well as others described in Appendix II) 


- - 








i 


to compute estimates of the measurement uncertainty, because we do 
regard as important the issue of what proportion of the variability rele- 
vant to our inferences is in fact due to measurement error. Our result 

• i' 

is that this proportion is small, indeed usually negligible in all cases. 

The Appropriate Inference . The question associated with the 
relevant inference can be posed in this way: "If we regard the present 

MBE as a particular realization of an MBE under these experimental 
conditions, what inferences can we make about the uncertainties in net 
retention under these experimental conditions? " In the context of the 
Skylab experiment, the difference is between asking "What happened in 
this experiment to this astronaut while in flight?" and asking "What 
is likely to happen to this astronaut if he were to repeat the experiment?" 
[Since SMEAT is regarded as a prototype of Skylab, it is relevant to ask 
what inferences can be drawn from a comparison of the data obtained in 
SMEAT and that obtained in Skylab? This cannot be answered without data 
from Skylab but this is discussed below.] 

To make inferences with this wider scope, we must contend not 
only with measurement errors but also with the "biological" variability 
that would be present even if all measurements could be made with perfect 
precision. Indeed, we find that to make inferences with this wider scope 
requires paying a high price in the efficiency of our estimates — i.e., our 
confidence interval (7) that makes a statement about the random process from 
which we have observed a single realization proves to be much wider than 
that pertaining only to the particular realization. This follows from the 
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unfortunate fact that biological variability far outweighs measurement 

error. The conceptual distinction reflects itself dramatically in 
the very different estimates of 0 Z (I) , 0 2 (U), and 0 2 ( F) that are to be 
substituted in equations (4) , (6) and (7) . 

Summary of Estimates . Table 1 provides our estimates of the 
mean and the variance of I, U, F, and NET for each subject where 


NET = I - U - F 


The three subjects are identified as S, C, and P (or as SPT, the 
scientist-pilot; CDR, the commander-copilot and PLT, pilot) . One value 

A A A 

for E and two values for 0 are included, the first relates to the 
entire study and the second roughly to the in-'chamber period. The 
in-chamber period was the most orderly phase of SMEAT and the estimated 

A a 

cr are usually, but not always, somewhat smaller than for the entire 
period. The second 0 thus presumably represent the minimal variance 
that can be ejected in such studies. 

Section II describes how the estimates of 0 were computed. 
Appendix I documents our efforts to reduce these large variances by 
searching for patterns in the data. Our principal finding is that, 
unfortunately, substantial reductions in variance are not available and 
the variances presented in Table I are realistic estimates of the 
variability inherent in metabolic balance studies. 

Table 2 summarizes the findings derived in Appendix II relative 
to the fraction of the 0 which is due to errors of measurement. The 
total variance has been decomposed into the variance due to biologic and 


* 

measurement components ; 

° 2 total 



biological 


+ a 


2 

measurement 


Table 1. E, a 2 for I, U, F and ftzrf (g and g 2 , respectively) 






I (input) 

U (urine) 

F (feces) 

NET 

"s« 

Nitrogen 

/v 

E 



16.819 

15.202 

0.9367 

0.680 


a 2 

(diet periods 

1-12) 

2.1962 

36.040 

0.4166 

38.653 


a 2 

(diet periods 

4-12) 

0.3590 

29.521 

0.4620 

30.342 

Calcium 

A 

E 



0.8593 

0.1172 

0.5695 

0.1725 


a 2 

(diet periods 

1-16) 

0.001510 

0.0007039 

0.1951 

0.1973 


a 2 

(diet periods 

4-13) 

0.0005960 

• 0.0006063 

0.1367 

0.1379 

Potassium 

A 

E 



4.0708 

2.8674 

0.3303 

0.8732 


a 2 

(diet periods 

1-16) 

0.5930 

0.5087 

0.08932 

0.6573 


a 2 

(diet periods 

4-13) 

0.1107 

0.4248 

0.08461 

0.5205 


"C" 


A 


Nitrogen 

E 



16.1812 

13.296 

0.9205 

1.9675 


a 2 

(diet periods 

1-16) 

0.2798 

10.481 

1.1496 

11.910 


a 2 

(diet periods 

4-13) 

0.2371 

8.7675 

0.9663 

9.971 

Calcium 

A 

E 



0.8498 

0.2466 

0.4418 

0.1619 


a 2 

(diet periods 

1-16) 

0.0001327 

0.001744 

0.2726 

0.2745 


a 2 

(diet periods 

4-13) 

0.00002188 

0.001239 

0.2779 

0.2791 

Potassium 

A 

E 



3.8781 

3.05947 

0.2581 

0.5605 


a 2 

(diet periods 

1-16) 

0.007779 

0.30776 

0.9127 

0.4068 


a 2 

(diet periods 

4-13) 

0.001291 

0.21774 

0.0933 

0.3123 


*lp«» 


A 


Nitrogen 

E 



16.798 - 

13.8117 

1.1727 

1.8136 


a 2 

(diet periods 

1-12) 

0.5450 

13.9891 

0.4826 

15.0167 


a 2 

(diet periods 

4-12) 

0.5450 

13.1447 

0.4284 

14.1182 

Calcium 

A 

E 



0.8484 

0.1223 

0.5765 

0.1496 


a 2 

A O 

(diet periods 

1-16) 

0.0001494 

0.0004431 

0.1492 

0.1498 


a 2 

(diet periods 

4-13) 

0.00002137 

0.0003564 

0.1207 

0.1211 

Potassium 

A. 

E 



3.9701 

3.016 

0.3241 

0.6308 


a 2 

(diet periods 

1-16) 

0.009418 

0.1802 

0.7413 

0.9309 


a 2 

(diet periods 

4-13) 

0.003722 

0.2050 

0.04208 

0.2508 


and in Table 3 


- 9 - 


100 a 


measurement 


total 


and is shown for U, F, and NET of Ca and N for astronaut C. 

It is apparent that measurement error contributes only 
minimally to the uncertainty associated with the mean values observed 

^ A 

in the study. [N.B. As section II will make clear, our estimates of a 

for I in Table 1 do not include any variance contribution from the purely 

■cyclical part of the diet variability as, indeed, one would not wish 

them to. Also, notice that any deviation from the pure cyclicality must, 

in fact, be due to measurement error and/or lapses in the experimental 

procedure. Absent the latter, then, the entries for I in Table 3 must 

be 100% and any deviation therefrom must be due either to accidents 

peculiar to this experiment or to artifacts of the calculation procedure — 

neither of which would be important to document. Indeed, these causes 

reflect themselves in (J . (I) for Ca for astronaut C in Table 2 

measurement 

A 2 i • 

being greater than both a (I) s in Table 1. This is a nonsensical result 
if taken literally but Table 2 was calculated from day 2 of C’s diet.] 

The fact that the percentages in Table 3 are so small demonstrates 
that little can be gained by improvements in analytical accuracy. We have 
accordingly devoted considerable effort to exploring whether the biologic 
variation can be reduced by exploiting interdependencies among the data 
(Appendix I). Unfortunately, no means of reducing the variance has been 
* found and the values presented must be considered to be .representative of 
the information gained from this kind of metabolic balance study. 
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Table 2. 

Measurement-Variance 
day's output) 

Estimates, g 2 

(CDR; day 2 diet; 

"typical" 

, <** 

Input 

Urine 

Feces 

NET 

Nitrogen 

0,046178 

0.1371 

0.01162609 

0.1950 

Calcium 

0.00016909 

0.00008968 

0.00022533 

0.0004850 


Table 3. 

Percent of Total-Variance 
Measurement Errors (CDR) 

Estimate 

(for periods 4-13) Due to 


Urine 

Feces 

NET 

Nitrogen 

1.6 

1.2 

2.0 

Calcium 

0.7 

0.08 

0.17 
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Biologic Implication, of the SMEAT Data . It would seem certain 

that in well-nourished young men who consume a "normal" diet and who do 
not change weight to a significant degree that the body composition must 
be approximately constant over rather long periods of time. If this is 
true, the amount of various nutrients lost from the body must be approxi- 
mately the same as the amounts consumed. This is the fundamental 
assumption upon which balance studies such as SMEAT and Sky lab are 
based. 

Consider now the mean retentions for the various nutrients 
shown in Table 1. For astronaut "S" the average retention of nitrogen is 
0.68 g/day. Thus, over an 100 day period one would calculate that a total 
of 68 g of nitrogen would be retained if this value is correct. This would 
be equivalent to 425 g of protein (protein = N * 6.25) or 2360 g of new 
tissue if one accepts the conventional figure of 18% of protein in average 
tissue. The latter assumption would also lead to an estimate of approximately 
12.6 kg of protein in an average man weighing 70 kg. Thus, the net retention 
of nitrogen would lead to an estimate that the amount of nitrogen retained 
during an 100 day study would increase the total body protein approximately 3.4%, 


■ 3.37, 

A similar calculation for astronaut "C," showing a mean retention of 1.96 g N/day 
leads to an estimated increase in the body protein of 1.225 kg in 100 days 
or approximately an 9.7% increase in the total body protein. Since in the 
nitrogen retention data there is no apparent tendency for the retention to 
decrease with time (Section II, Figs. CUM1 - CUM3) , these values appear to be 
representative of these subjects under these conditions. Thus, had the study 


NITROGEN BALANCE, g , do. 



Fig. 1 

In the studies of Young et al. [2] and Calloway and Margen [3] , egg protein 
was fed to young adult men at low levels. In the two papers of Clark et al. 

t 

[4,5] young men were fed a mixed diet which supplied from 5 to 16 g of 
nitrogen/person/day. The data of King et al. [6] were obtained with 10 young 


Ji 
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pregnant women who received up to 18 g N/day . These women were 4 to 7 years 
postmenarche and were considered to be mature. Thus, the nitrogen retention 
would be expected to be that required for the formation of the fetus and 
the accessory tissues related to pregnancy. Abernathy et al. [7] studied 
young girls weighing from 25-30 kg. Standard tables indicate that such 
girls should be gaining about 6-7 g/day. The results were reported as mg N 
retained/kg/day and these values have been multiplied by 65 to make the 
results more or less comparable to the other data obtained with adults. 

The values plotted would represent the retentions expected in adults growing 
at the same rate as the children. No other corrections for body size were 
made in the data plotted although there were substantial differences in the 
weight of the subjects studied. Finally, the single average value obtained 
by Grindley [8] is shown. This was obtained with 23 young men who were 
continuously investigated over a 220 day period. They received a more or 
less normal diet which supplied about 13 g N/day.* 

The regression line in the upper right hand portion of the 
figure was drawn by inspection. Since each point represents variable 
numbers of subjects, the true regression line would be obtained by 
appropriate weighting for the number of subjects. It is not likely, 
however, that this would deviate greatly from the line shown. 

It is quite remarkable that in this heterogeneous group of 
experiments in which the subjects differed in age, sex, size and 
physiological state, and in which the experimental conditions varied as 
well as the nature of the protein fed, nitrogen intake is the only 
Variable which appears to explain the nitrogen balance. Indeed, 
the variability around this line when the intakes exceed 


5 g N/day does not appear to be much greater than that observed in the 

individual experiments. 'There is no indication that a "plateau" is reached 
as the intake is raised. 

The average retehtions observed for each of the subjects in 
SMEAT as well as the grand mean for the three subjects are also shown in 
Fig. 1. It is apparent that although most of the balance studies reported 
in the literature are for relatively short periods, usually two weeks or 
so on the diet and measurements being made during the latter part of 
the study, there is little indication that the data from SMEAT vary 
appreciably ~ from the general data obtained. Since neither SMEAT nor the 
long-term balance study of Grindley depart significantly from the general 
findings, "adaptation" to the diet does not appear to be a significant 
factor in the results obtained. This also confirms the conclusion 
(Appendix I) that in nitrogen balance studies little is gained by extension 
of the study period. 

The values above the balance line indicate a "requirement" 

(where the balance is zero and excretion approximates intake) of about 
0.5 g N/person/day . This would be equivalent to about 31 g of protein/day 
for a 65 kg adult. Similar estimates have been reached by many investi- 
gators in the past. 

The slope of the regression line at the higher intakes indicates 
a retention of approximately 20% of the intake above 5 g/day. That is, 

there is a retention of approximately 2 g N/day at an intake of 14 g/day 

2 . 

.and x 100 = ± 20%. A retention of 2 g of nitrogen is equivalent 

* 14-5 

to 12.5 g of protein or about 70 g of tissue if one accepts the conventional 
figure of 18% protein in body tissue. This would result in a gain of 
over 25 kg/year and clearly cannot be typical of young men who do, in 

t 


- 15 - 


fact, often consume diets containing 14 g of nitrogen or more. 

Since nitrogen and other nutrients are lost through the skin 
as sweat, desquamation of skin, hair, nails, etc., and these are not 
measured in the usual balance study, the positive balances are falsely 
high. Attempts to measure such losses, however, indicate that under 
ordinary conditions they are of the order of 0.3 to 0.5 g N/day [9,10]. 

If one accepts -the higher value and subtracts 0.5 from each value, the 
line is shifted to the right but the conclusions are not changed to any 
substantial degree. 

Forbes [11] has suggested that the balances are erroneous 
because modification of body composition after a dietary change may occur 
slowly and exponentially with time. The usual short balance periods of 
two weeks or so would be too short to identify stable conditions . There 
is probably sufficient data to show that this suggestion is not valid when 

the protein intake is lowered and that relatively stable conditions are 
found within a week or 10 days. Adequate data after the dietary protein 
is: raised have not been found. However, the data reported by Grindley 
and the present findings of SMEAT lend no support to the argument presented 
by Forbes, at least insofar as nitrogen balance data are concerned. 

Wallace [12] has presented the reasoned argument that there is 

a Consistent "bias" in balance studies which inevitably lead to falsely 

high apparent retentions. It can be assumed that in most balance trials 

the intake will be overestimated since the subject may not eat all of 

the food offered but has little or no opportunity to consume more than 
♦ 

offered. Excretion will generally be underestimated since some loss is 
probably inevitable but it is difficult to collect more excreta than are 
actually produced. In the context of SMEAT when the men were actually 
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in the chamber, we see little possibility that the high net retentions 
could be explained on this basis. The average net retention of the 3 
subjects was 1.48 g N/day, approximately 13% of the nitrogen intake above 

5 g/day. Retentions this high would require overestimate of the intake 

*i » 

by about 6 to 7 % and similar underestimates of the excretion or larger 
errors in one or the other. This seems most improbable. 

The average net retention of potassium was 0.688 g/man/day or 
some 68 g/100 days. Considering the fact that the estimated total body 
potassium is of the order of 120 g, it would follow that if this were a 
measure of true change in body potassium, the body potassium would double 
over a 200 day period. It can be assumed that relatively large amounts 
of potassium are lost through the skin and the results of potassium balance 
studies are not meaningful. 

The calcium balance data are more difficult to interpret. 

Although they show a consistent and improbable retention (average 
retentions = 161 mg/man/day or a total net retention over 100 days of 
16 g of calcium) , the amount retained would be relatively small compared 

I 

to the total body calcium content. If it be assumed that the total body 
contains something of the order of 1.5% calcium, the total body content 
would be of the order of 1.05 kg. A net retention of 16 g during a 100 
day period would amount to only 1.5% of the total body calcium, apparently 
within the range of possible retention without observable physiological 
consequences. Of course, if this rate of retentions is extended to long 
periods, it leads to ridiculous conclusions. The cumulative retention 
curves (Section II, Figs. CUM4 - CUM6 ) , however, indicate a gradual reduction 
in the degree of calcium retention suggesting, possibly, an adapatation to 
the diet fed. 
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Duncan [13] reviewed the results of calcium and phosphorus 
balance trials in ruminants. Carcass composition is better known in animals 
than in man and in some of the trials animals were killed for carcass 

I 

analysis. Her conclusions, in part, were that "cumulative estimates of 
retention add up to totals far in excess of likely increments" and that 
"Careful examination of published metabolism trials has revealed no 
probable explanation. The discrepancy cannot arise from losses through 
the skin and is not likely due to failure to collect all of the excreta." 
These results are important relative to the nitrogen balance studies since 
there has been speculation that the false positive nitrogen retentions 
might be explained by reduction and loss of nitrogen as gaseous N 2 or ammonia 
Volatile losses clearly cannot explain false retentions of calcium. 

If SMEAT is considered to be a typical metabolic balance study 
or a prototype of Skylab, the' results of Skylab will presumably be compared 
with the data obtained in SMEAT. The confidence intervals of the NET of 
the various nutrients are, therefore, important. Table 4 from Appendix I 
presents the most optimistic estimates of the confidence intervals for 
each nutrient for each subject- 


Table 4. 

Long-Term Confidence 
(in g) 

Intervals for 

Net Retention 

Nutrient 


Subjects 



S 

C 

P 

Nitrogen 

0.60-1.96 

1.30-2.60 

1.00-2.70 

Calcium 

0.10-0.25 

0.06-0.27 

0.08-0.22 

Potassium 

0.73-1.02 

0.45-0.67 

0.53-0.73 
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Considering the large variance attached to NET of each subject 

and the rather large differences in mean NET and variance between subjects, 
the difficulties or limitations in comparing one study with another with 
different subjects and under different environmental conditions becomes 
obvious . 

From the net retentions of nitrogen and potassium, it is clear 
that the findings are unrealistic and it may be assumed that the calcium 
balances are also unrealistically high. Although it cannot, be proven, the 
only possible explanation that seems reasonable is that other losses 
(sweat, skin, hair, etc.) which are not measured are higher than the 
conventional estimates. Very large losses of certain nutrients — calcium, 
for example — have been reported in men under high environmental temperatures 
[14] . The losses reported, in fact, were so high that they could not possibly 
be representative of most people living in hot climates. Nevertheless, if 
this is the explanation of the results obtained, both of the average 
retention and the differences observed between experimental subjects, 
it cannot be assumed that meaningful results will be obtained by applying 
a standard correction for such losses as is often assumed. They will 
have to be measured if, as the data suggest, they may be the primary 
cause of differences and unexplained results obtained with balance studies. 
This is probably particularly true in Skylab where the effects of the 
environment on dermal losses are completely unexplored. 

The difficulties associated with accurate measures of skin 
losses are also well known. The dangers of contamination are great; 
collecting skin losses requires large volumes of fluid and imposes 
analytical difficulties; cooperation from subjects is also difficult to 
maintain. It seems unlikely that meaningful experiments will be done 
except under unusual conditions. 
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There is great need, therefore, for independent measures of 

body composition. In the studies of King et al. [6], the average values 

of which are plotted in Fig. 1, the changes in total body 40 K were 

determined. Assuming the conventional relationships between body K and 

body N, the 40 K counts yielded estimates of nitrogen retention ranging 

from 29 to 127% of the values estimated from the nitrogen balance studies 

with an average value of 67%. It appears from the published data that 

there was no correlation between the estimates derived from the balance 

studies and those from the 40 K counts. The mean value of 67% in these 

pregnant women who were obviously retaining nitrogen appears to be more realistic 

than the nitrogen balance data. 

We unfortunately must conclude that however accurately metabolic 
balance studies are conducted they will inevitably lead to uncertain results. 

The usual experience is that the more nutrient fed the more "apparent 
retention" is obtained and that the results are not made more interpretable 
by very long-term studies. Since the reasons for the "errors" are unknown 
(even though skin losses seem the most logical explanation) , comparative 
studies with different environmental conditions or different subjects must 
be viewed with great caution and suspicion. Calcium balances are particularly 
difficult to interpret. Adaptation to new conditions may require months and 
the losses or gains in total body calcium which appear, to occur, as in SMEAT, 
may be insignificant in terms of the total body supply. Clearly, one cannot 
project losses observed over even a few months to long periods. 

Studies utilizing activation analysis for estimation of total 
body calcium or nitrogen or the measurement of total body potassium by 40 K 
counts or the estimation of total potassium pools appear to offer the best 
direction for research. Even though the accuracy of such measures is difficult 
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to establish, they should be satisfactory for comparative measurements, 
before and after experimental treatments,, as in the exposure to weightless 
conditions. Until such independent measures of total body composition are 
available, the interpretation of balance studies will remain questionable. 

SECTION II 

Our program in the remainder of the text and Appendix I shall 
be (1) to explain how the estimates of total variances (and means) in 
Table 1 were obtained; (2) to assess their implications regarding MBE 
accuracy and design; (3) to explore where the larger variance-contributions 
are and whether they can be reduced significantly. 

Appendix I essentially shows that these variance estimates 
in Table 1 represent realistic lower limits to the variabilities inherent 
in the MBE procedures for each of the three substances — N, Ca and K. 

Derivation of Table 1 . Does diet "drive 11 excretion? One 
question in our search for structure to the basic data series, I, U and F, 
is: to what extent are the excretory variables driven by the periodic 

diet? Because the diet is, with only moderate departures, repeated 
every 6 days, i.e., there are 6 one-day menus for each of the three 
astronauts that are administered cyclically, this question centers around 
searching for periodicity in U and F with a 6-day cycle . Appendix I 
applies to these series Several statistical techniques well suited to 
♦examining cyclical features of time series; these include autocorrelations 
and autoregressions, and spectral and cross-spectral analyses. 
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We should like our summary estimates of total variability not 

to reflect any variation in I, U and P due to these 6-day periodicities. 

In order to obtain even more "optimistic” variance estimates, we should 

like t6 regard as being "known in advance" any trends across time between 

6-day periods in these series. 

Interpretation of Variance Estimates . Each a in Table 1 

is accordingly the residual mean square of a two-way analysis of variance 

(AV) of each of the I, U and F series viewed as having 6-day periods, i.e., 

from each series {Uj.} say, we define a matrix U whose (l,m) element obeys the 
relation 

(8) u lm = U ei +m 

This amounts to regarding each such u^ m as 

(9) u lm = E < u > + T l + ^ + e lm 

where E(U) is an overall mean,- T ^ is a "period-effect" (which would 
increase with increasing 1, say, if the astronaut tended to lose more 
mineral in the urine as the MBE progressed), d m is *a *diet-day effect" 

(which might reflect the 6-day sequence of the diet "loadings" of the mineral) 
and e^m a ran< * om error, identically distributed for all (l,m) . The 
residual mean square is an estimate of a 2 (e) and is thus an optimistic 
estimate of 0 2 (U) as it is less than the total mean square 

[£ £ (U, - E(U) ) 2 / (length of series!] 

i i 

! 1 m 

by an amount that increases as the rows and columns of U do display 

* 

the effects allowed for in increasingly systematic degrees. Because 
any departure from randomness between 6-day periods (that would give a • 
net higher variance than under-randomness) would reduce the mean sum of 
Squares, the variance entries in Table 1 are optimistic in that they 
reflect a reduction due not necessarily (or only) to trend but rather 
to a wider class of alternatives to the assumption of between-period 
randomness . 
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Contributions t.o Total Variance CT 2 . Table 1 shows that, for 

N and K, the primary source of the variability in NET is the urine, i.e., 

jo z (U) > > 0 2 (F ) ) 

, 

for calcium, by contrast, 

a 2 (F) > > a 2 (U) . 

The latter fact reflects the violently fluctuating character of all the 
fecal series and the fact that more calcium is excreted in the feces’ 
than in the urine; the former reflects the fact that more N and K are 
excreted in the urine than in the feces. Figs. C1-C6 are the graphs 
of I, D and F for N and for Ca respectively for astronaut C. 

Notation Convention . From now on we refer to data series in 
the forms indicated in the titles of the figures as: letter-name-substance 

where letter = astronaut identifying letter (S,C or P) , name = the variable 
name (I, U, F, NET, etc.); substance = substance (N, Ca or K) . Thus 
SNET Ca refers to the net retention of Ca in subject S. 

Implications of Table 1. As discussed previously, we note' 

A 

that all values of E (NET) are positive and these values cannot represent 

reasonable estimates of actual retention. The retentions are clearly 

artificially high since dermal excretion was not measured. Estimates 

available currently for the amount of dermal losses do not appear to be 

large enough to account for these retentions, yet this provides the only 

reasonable explanation of the results obtained. More important, however, 

would be the implication that if all three of the astronauts were, in fact, 
"in balance" then the dermal losses are highly variable in both amount 

and composition. Clearly, if this is so, such losses would have to be 
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measured if the balance data are to be meaningful. The application of 
an "average" correction for dermal excretion, whatever the value chosen, 
leads to clearly unacceptable results for one or more of the astronauts. 

Long-Term Confidence Intervals for NET Retention . The E (NET) 
and C 2 (NET) results in Table 1 allow fashioning an approximate confidence 
interval for NET by 

(*) E (NET) ± 2 

where min(a,b) = smaller of a and b. Choosing' the smaller variance 

will give the more optimistic interval; the factor "2" assures that the 

probability that the interval (*) does not include the "true" NET is small, 

but the exact probability is not available as not enough information is 
available to specify the theoretical distribution of NET. Normal probability 

plots for NET (of which we include only one for SNET N in a later section) 

suggest that NET iss sufficiently gaussian for this approximate factor to 

yield appropriate intervals but not so for attaching precise tail- 

probabilities. 

These confidence intervals (*) are based on the entire MBE, and 
as such may perhaps be regarded as "minimum feasible" uncertainties, 
characterizing their respective MBE's, i.e., intervals that can be achieved 
through very long MBE's. Table 4 (p.15. Introduction) exhibits these 
long-term confidence intervals for each of the 9 NET series of Table 1. 

For each substance, the three long-term confidence intervals in 
Table 4 overlap lending credence to the findings. These overlaps, along 

* >S 

with the fact that all 9 intervals show E(NET) > 0 throughout, are 
consistent with the MBE design bias toward apparent retention. 


/ 


min (CJ ("1-16") , CT* ("4-13") ) 


/length of NET-series in days 
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"Updated" Average Daily Retention . Figs. CUM 1 ~ CUM 9 exhibit 

the 9 graphs of the variable CUM of equation (2) for the three astronauts 
and the three nutrients under study as against the day number of the MBE's. 

Gaps in some nitrogen data* allow us to continue two of the CUM nitrogen series 
only to day 74; the other 7 extend to day 101 (for "C" and "S") or 102 (for "P") 

Figs. CUM 10 - CUM 12 plot the CUM series for all three astronauts 
on the same axis, for each of the three substances: these plots exhibit 

the gradually increasing consistency in the average NET's for each astronaut 
as their MBE's progressed. 

The upturn of SCUMK on days 89-101 is spurious being due to a 
sudden increase in SPTICa during these final days to which SPT evidently 
had not the time to acclimatize. 

The CUM plots have the general appearance of oscillating wildly 
for the first 3 or 4 diet periods (18-24 days) and then assuming a smooth 
character that gradually either increases or decreases, often with a perceptible 
nonlinearity (concavity or convexity) . A qualitative look as to how far along 
into the MBE's the CUM "stabilizes" suggests that comparatively little new 
information is added by, say, the second half of the MBE; the implication 
regarding optimal MBE design will be discussed in a following section on 
conjunction with the issue of distinguishing true , or biologically-based, trend . 
We wish now to discuss the interpretation of the concavity noted above as an 
artificial trend . 

Artificial Trend . The concavity, giving an appearance of trend in 
average NET over time, is at least partly an artifact of the averaging process. 
This effect can be understood as being due to a term in CUM^ equal to mA/k 
where A is the difference between the average of the daily NET'S up to day m 
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and this average after - day m, where m is small compared to k. 

Fig. CUM 13 emphasizes this point. This plot shows a not atypical 
realization of versus k, where 

•*' . k 

C = l Njl 
k i=l 1 

and N^ is a random sample of size 100 from a gaussain distribution with 
mean 0 and variance 1. 

We do not discuss how departures from gaussianity of the NET 

distributions are reflected in the analogous CUM^ versus k graphs; for 

the present purpose we find the empirical distributions of the NET variables 
to be sufficiently gaussian as shown by normal probability plots. Fig. NORMN 

is such a plot for SNET N (the SN MBE was a comparatively variable MBE) . 

Note : Although the plot is close enough to a straight line for the 

present purpose, it has perceptibly heavy tails, a typical finding for the 
other NET plots (which we do not include) and this accounts for our qualifi- 
cations about the exact confidence levels associated with the intervals 
computed below. 

Coefficients of Variation per MBE . If we regard for the moment 
the estimates of NET variance from periods 4-13 in Table 1 as estimates 
available to the MBE's prior to the MBE, i.e., as known characteristic 
numbers for this experimental procedure (and also for this subject in the 
present use), then we may obtain estimates of the accuracy of the nine MBE's 
corresponding to each subject and each nutrient as a function of MBE length 
in days as follows. The coefficient of variation for each MBE on day d is 


d E d (NET) 
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where 0 is the variance estimate as above from Table 1 and 

d - 

E d (NET) = CUM d = i^NET^. 

~a 

(Compare {*) where d = maximum day number). Figs.. CV 1 - CV 9 show CV d versus 
d, i.e./ the uncertainty in the MBE as a function of experiment's duration 
in days, using the "updated mean" E^ and the "prior variance" a 2 . 

(An "updated variance" might involve successive analyses of variance for 
successive periods, but we do not view this as an important ramification 

A 

for the present study.) Because E d (SNET N) 0 as d + 101, the CV 
fluctuates wildly; for this case we display in Fig. CVla the more stable 
statistic CV 1 where 


cv 1 = <1! 
'"'a 


E 101 (SNET N) 

which regards the limiting NET as known in advance of the MBE (as CV regards 

i * 

the variance estimate only ) . For this case CV is proportional to d 
exactly which accounts for its smoothness. Fig. CV 3 a exhibits CV for 
PN on an exceptional scale; as PCVN does not take on reasonable values 

except for very large day numbers, we includ PCV 1 N as well. ( Note : Because 

l -k 

CV (and CV ) vary, approximately and exactly respectively, like d 2 , 

where d is the day number, the asymptotes of the CV graphs are all, of 

course, the CV = .00 - line.) The CV's for K are very consistent and, 

in view of the graphs' stabilities suggest that together they constitute 

a fairly accurate determination. This, however, is only to say ’that the data 

’are rather consistent. We have previously noted that the absolute retentions 

of potassium are so high as to be meaningless and large unmeasured losses 

must occur. The retentions for calcium do not yield as consistent a picture 

and those for nitrogen are least adequately characterized. These "updated" 
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CV characteristics accord with those of the limiting confidence intervals in 
Table 4. 


< s The "Updated" Confidence Interval for Net Retention. The 
"updated confidence intervals" (equation 7) for average net retention, 
E (NET) , as a function of MBE duration, may be obtained in view of (3) 
and (4) from the CUM k and the value of a 2 (NET) in Table 1, for any 
astronaut and any substance, as 


k 

l NET 4 
i=l 

± 2 • /a 2 (NET) /k 

Equivalently , this confidence interval may be computed roughly from the 
graphs as 

(7') CUM k ± 21(01^) (CV J . - 

> . . . 

Real Trend in NET Over Time . It is possible that superposed 
onto the artificial trend is a real, biologically based trend in net 
retention as well. ‘ Such a supposition is suggested by the fact that all 
three of the CUMCa plots show a decrease as d increases, which under an 
assumption of stationarity in NET (i.e., no trend) would occur only one 
time in roughly 3 • 3 • 3 = 27 (supposing three informally distinguishable 
"graph-characters:" increasing, level and decreasing and assuming the 
other substances' NET's are known stationary; this is obviously only a 
rough calculation). 

By contrast, the apparent trends in the three subjects for N 
and for K do not appear systematic in any obvious way. 

The hypothesis that the CUMCa' s exhibit the superposition of 
a biologically-based trend in NET is supported by tne finding that 
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truncating the first 27 days from the NET Ca's and computing the 
corresponding "CUM28" , i.e., 

d 

.£ NET Ca 

CUM 2 8 Ca, = — — 

d d - 27 

, / ' ^ • 

yields a limiting level to CUM28 Ca significantly lower than that of the 
corresponding CUM Ca (see Pig. CUM28) . This suggests that the subjects 
had a higher pre-chamber retention (average NET Ca) than that in the in- 
and post-chamber phases. Biological considerations would lead to an 
a priori expectation in MBE's of the present design that, if any of N, 

Ca and K exhibit real trend, Ca would be the most likely candidate. This 
lends support to the foregoing arguments. ( Note : Because the pre-chamber 

period was often exceptional in SMEAT, this truncation technique can often 
also serve to "flatten out" the artificial trend. We do not discuss 
details of this use. The degree of "flattening" achieved by truncation 
varies from plot to plot.) 



MBOL SCALE NAME 

® #1 SCUM2SCA 

* #1 CCUM2SCA 

#1 PCUM28CA 


Fig. CUM28. Cumulative calcium retentions for the 3 
astronauts over time during the latter 
part of the study (see text) . 
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Measuring Relative Long-Term Trend . We may obtain quantitative 
information on the patterns of long-term trends as follows. The preceding 
discussion suggests the possibility of using the long-term NET K pattern 
as a "reference" and measuring NET Ca trend with respect to NET K. To 
do this, we fit the regression models 

NET Ca ~ a + b • NET K + error 

As As 

to obtain fitted coefficient a, b and thereby to obtain residuals 

RESi = NET Ca.^ - a - b • NET Ki 

We now regard the RES series as having a 6-day period and perform a two- 
way analysis of variance on RES and any patterns to the row effects, if 
"non-random, " giving us infomration on the differences between the long-term 
pattern of Ca retention from that of K. Tables AVres 1-3 show the results 
for S, C and P, respectively, of the regressions followed by the A V on 
the residuals (excluding the irregular ends of the MBE's). In all three 
cases, the row effects clearly exhibit a non-random favoring of positive 
ones for the early periods of the MBE's and negative row effects for the 
later periods. This relative trend seems strongest for S. Thus, the 
astronauts all exhibited systematic decreases in their average Ca retentions 
as their MBE's progressed, as compared to their K retentions as measured 
by the residual AV. 

Note 1 . As is the case with all three substances, any trends in 
NET K values represent the superposition of biologically based trend and 
bias due to the MBE procedure. Thus, although the K retentions are 
relatively stable (see Fig. CUM 12, remembering the end-correction required 
for SCUMK) , their numerical values seem to be even less realistic than 
those of N and Ca. If we can assume, however, that the bias due to the 
MBE procedure is constant throughout the range of the MBE considered (in our 
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Table AVres 1 


I* SNSTCA = A+3*SNEIK 


MOB = 

90 

WOVAR = 

2 



' RANGE 

= 1 

TO 90 


■ ■ 


RSQ = 

0. Qb747 

CRSQ 

= 0.0467b 

F(l/88) = 

-SER = 

447. b340 

SSR 

= 1.763E+07 

DW ( 0 ) = 1.99 

CO Er 


VALUE 


ST ER 

T-STAT 

A 


7.3.42530 

68. 14380 

1,07751 

B 

• 


0. 14 133 

0.06102 

2.31633 

TWO 

NAY 

ANALYSIS 

OF VARIANCE 


) RON 

’ SSQ 

D.F. 

MEAN SQUARE 

F(RESID) 

2. 833738 E+06 

14 

202413. 

1.1038 

COL 

1 .95 682 7 E+06 

5 

39176b. 

2.13637 

RES 

1 .2836543+0 7 

70 

133379. 


TOT 

1 .76291 /E+07 

89 



GRAND 

MEAN 

— 

0.0:001 73 


R0.1/C0L ROrt EFFECTS 

COL EFFECTS 


1 


21 1 . bO 1 


— b 1 . 4b9b 


: 2 


237.641 


-173.2 1 1 


- 3 


-219.343 


-4.04297 


4 


311.832 


278.503 


5 


163.497 


-122.667 


6 


47.6747 


72.8774 



b • 365 


PROBABILITY 

0.370066 

0.071094 


7 i 46.462b 

8 ! Iib7. 08 d 

9 -174 . 4b 3 

10 I- 1120 .016 

11 - 1 3b . 987 

12. 1 -2.93294 

13 -2bd.b87 

14 -59.5543 

lb . -124.73 d 


I 
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Table AVres 2 


1* CNETCA = A+£i*CNETK* 


NOB = 90 NO VAR = 2 
RAnGE = 1 TO 90 

RSQ = 0. 2d 836 CRSQ 

; SER = 429.7430 SSR « 


0.25043 
l . 625E+07 


r( 1/88) = 30.735 

0 . 1 ( 0 > .= 2.21 


VALUE 


ST ER 


T-STAT 


-52.40340 

0.37637 


59.43460 

0.06771 


-0.83170 

5.54396 


Tif.'O nay analysis of VARIANCE 


SS Q D • F . 
1 . I 34795= +06 14 
167273. 5 
1.489972E+07 70 
1 *625 1 79E+07 89 


MEAN SQUARE 
84626.2 
33454.6 
2 1 28d3. 


F(RESID) 

0.39769 

0.157172 


PROBABILITY 
0.97 119 
0. 977 I 66 


GRAND MEAN = 0, 

ROh/COL RON EFFECTS 

1 62.6124 

2 ■ ' : 27.3992 

3 110.821 

4 13.7146 

5 : -77.9852 

6, ' 7 96.7691 

7 I 108.696 

. 8 5.38876 

9 -124.364 

10 j -13.2777 

11 -175.534 

12 139.167 

13 -187.011 

14 171. 008 

T 1 5 -151.614 


0.000135 


COL EFFECTS 
-25.2339 
-83.3437 
47.34 1 1 
13.6332 
25 .864d 
21 .7388 
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Table AVres 3 


1* PnETCA = A+6*PNETK 

NOB = 90 NO VAR = 2 
RANGE - I TO 90 ' 

RSQ = 0.27994 CRSQ - 0.27176 FC1/36) - 34.212 

SEP = 327.2900 SSR = 9.426E+06 Dn(O) = 2.29 


COEF 

VALUE 

ST ER 

T-STAT 

A 

-46. 16990 • 

51.84700 

-0.89060 

B 

0.35349 

0.06129 

5.64913 


TWO 

WAY ANALYSIS 

OF VARIANCE 



■ 

SSQ 

D.F. 

MEAN SQUARE 

F (RE3ID ) 

PROBABILITY 

Pow 

1 .050636E+06 

14 

75049. 

0. 666759 

0.798261 

COL 

496734. 

5 

99346.8 

0.882628 

0.497449 

RES 

7 . 879057E+06 

70 

112558. 



TOT 

9.426478E+06 

89 




GRAND 

MEAN — 

0.000197 


' . . 


' ROri/COL RON EFFECTS COL EFFECTS 

1 -41.742b -116.232 

2 8b. 660b -4b. 60 11 

3 -195.755 98.4011 

4 103.374 -8.3369b 

5 239.207 - 15.0303 

6 bb. 6263 86.8712 

7 36.5577 

8 10. 1908 

9 51.3806 

10 -27.561 

11 -146.935 

12 -153.013 

13 -62.4802 

14 -8.34354 

15 r 53.3137 
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instances, days 1 through 90), then it is easy to see that the procedure 
proposed here is equivalent to comparing NET Ca with the biologically 
based part of NET K alone, as follows. If we let A equal the 
constant bias, so that 


NET K£ = NET KBi + A 

where NET KB^ is of course the biologically based portion of NET K 

th * 

on the i day, then it may be verified that 

A A A 

a fi = a + b A 

A A 

b fi = b 

A /S 

where a B , b B are the coefficients in a regression of NET Ca on NET KB. 
Thus, if 

RES B i = NET Cai - a B - b B NET KB ± 


we have 

RES B = NET Ca - (a + b A) - b (NET K - A) = RES, 

so that the residual with respect to NET K and with respect to NET KB 
are the same, as we set out to show. 

Role in Future MBE Analysis . This technique, when used in an 
"updated" manner as an MBE progresses, and making comparisons among several 
substances at least some of whose trend characters are approximately 
predictable in advance, may allow inferences about evolving trend in an 
MBE even before all substances have stabilized, i.e. , have zero trend, 
in their NET'S. 
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We unfortunately must conclude that however accurately metabolic 
balance studies are conducted they will inevitably lead to uncertain 
results. The usual experience is that the more nutrient fed the more 

L l*... 

"apparent retention" is obtained and that the results are not made more 
interpretable by very long-term studies. Since the reasons for the "errors" 
are unknown (even though skin losses seem the most logical explanation) , 
comparative studies with different environmental conditions or different 
subjects must be viewed with great caution and suspicion. Calcium balances 
are particularly difficult to interpret. Adapation to new conditions may 
require months and the losses or gains in total body calcium which appear 
to occur, as in SMEAT, may be insignificant in terms of the total body 
supply. Clearly, one cannot project losses observed over even a few months 
to long periods. 

Studies utilizing activation analysis for estimation of total 

body calcium or nitrogen or the measurement of total body potassium by 

** °K counts or the estimation of total potassium pools appear to offer 

the best direction for research. Even though the accuracy of such measures 

is difficult to establish, they should be satisfactory for comparative 

measurements, before and after experimental treatments, as in the exposure 
to weightless conditions. Until such independent measures of total body 

composition are available, the interpretation of balance studies will remain 

questionable. 
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APPENDIX I 


Periodicity and Correlation Structures in the Data Series 

Motivation . The rather substantial CV's, even the long-term 

. { 

CV's, move us to consider whether any systematic structure within or 
between the 27 time series for I, U and F summarized in Table 1 in Section I 
can allow reducing the 0 as displayed; Table 1 directs our attention in 
particular to the urinary nitrogen, urinary potassium and fecal calcium 
which contribute most to their respective a (NET) . 

Character of Finding . A naive first hypothesis is that what is 
excreted in the urine or feces is a function of the dietary input n days 
before. We have explored the data with such hypotheses in mind, attempting 
to predict F£ from its precursors, etc. 

Our principal finding is that such variance reduction is not 
available to any significant degree, and in this sense the estimates in 
Table 1 are realistic characteristics of MBE procedures. We found the 
interdependencies too weak compared to the inherent variabilities of the 
system. 

A major feature of the data that hindered the present inquiry 
was the highly irregular quality inherent in the F series. Although U 
can be presumed to be coming out associated with the I of hours before, 

F can be delayed by a variable number of days that need bear no relation 
to the diet cycle. * 

The hour of last urination before collection, as another data 
variable, could allow adjusting by some partitioning scheme for the 
"collection effect;" this could be especially important for N and K and 
other substances where the primary excretory channel is urine. 
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Significant Correlations . Table AI.l summarizes the significant 

correlations among the input and output series . "VOL" denotes the daily 
urine volume. (S = SPT, C = CDR, P = PLT in this Appendix) SPTUN (-3) 
denotes the "3-days lagged" SPTUN series. 

The W5-series denotes smoothed fecal series by the triangular 
moving average, 

W5Fi =0.05 F i _ 2 +0.25 F i _ 1 + 0.40 F ± + 0.25 F ±+1 +0.05 F ±+2 

We do not find any significant correlations for any of the three 
astronauts of the fecal series, thus smoothed, with either the input or 
the urine series, although there seems to be a trace of covariation with 
some lagged values of the variable, 

GUT = I - U 

which measures how much of the diet input is not excreted by the urine 
on a given day. 

The less-than full ranges in Table AI.l were chosen so as to 
exclude clearly aberrant end-values of the MBE's. 

The high correlations with VOL are somewhat surprising, but this 
is not a useful issue for us to discuss so we are simply documenting 
the results. 

The variables with the "SQ (ueezed) "-prefix denote the fecal 
series with the O's deleted; these are the appropriate measures of 
association of substances' fecal measurements , because the exact-zero 
entries inflate the correlation "artificially." The high contemporaneous 
correlations among the SQF's are not surprising as the feces can be 
expected to be fairly homogeneous on a macroscopic scale with respect to 
the several substances. 
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TABLE AI.l 

Significant Correlations 


Variable 

Variable 

Range 

Correlation 

SPTFCa 

SPTFK 

25 

- 

78 

0.851 



1 

- 

101 

0.791 

SPTUCa 

SPTVOL 

1 

- 

101 

0.937 

SPTUN 

SPTVOL 

18 

- 

83 

- 0.460 

SPTUCA 

SPTUN (-3) 

20 

-- 

83 

0.223 

SPTUN 

SPTUN (-1) 

20 

- 

83 

0.203 

SPTUCa 

SPTUK 

20 

- 

83 

0.512 

SPTUCa 

SPTUK (-1) 

20 

- 

83 

- 0.239 

CDRUCa 

CDRUCa (-2) 

6 

- 

102 

0.261 

CDRUCa 

CDRUCa (-4) 

6 

- 

102 

0.215 

SPTUCa 

SPTUK 

25 

- 

78 

0.443 

CDRFCa 

CDRFK 

1 

- 

102 

0.952 

SQCFCa 

SQCFK 

1 

- 

64 

0.899 

SQPFCa 

SQPFK 

1 

- 

87 

0.838 

SQSFCa 

SQCFCa (-4) 

6 

- 

84 

0.255 

SQPFCa 

SQPFCa (-2) 

6 

- 

87 

0.270 


(-3) 




0.279 

SQSFCa 

SQSFK 

.1 

- 

84 

0.716 



15 

- 

70 

0.754 : - 

CDRFN 

CDRFCa 

1 

- 

70 

0.922 

CDRUCa (+3) 

CDRUCa (-3) 

24 

- 

80 

0.409 


(-2) 



- 

- 0.376 

PLTUCa (+3) 

PLTUCa 

4 

- 

86 

- 0.311 


(-3) 




0.257 

SPTUCa 

SPTVOL 

1 

- 

101 

0.626 

CDRUN 

CDRVOL 

1 

- 

102 

0.229 

CDRUCa 





0.539 

PLTUN 

PLTVOL 




0.371 

PLTUCa 





0.373 

SPTUK 

SPTVOL 

1 

- 

101 

0.451 

CDRUK 

CDRVOL 

1 

- 

102 

0.579 

PLTUK 

PLTVOL 




0.450 

SGUTCa 

W5SFCa (+2) 

3 

- 

95 

0.221 

CGUTCa 

W5CFCa (+1) 

3 

- 

96 

0.114 

PGUTCa 

W5PFCa 




0.230 


(+1) 





0.114 


(+4) 

3 


96 

0.149 

The highly 

varying non- 

-gaussian 

character 

•*« M< h- -¥ 

of the fecal series, 

coupled with the sense that F^ 

represents 

the I 

from a variable' number 


of days before, suggests that some non-parametric technique might be helpful 
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We have considered the series 

FRI-l = rank (F^) in the set {Fj}j 


maxiura day no. 
=1 


where all zeros are given a rank of 1, so that for example 

4 i 

SFCa 20 " SFCa 35 

shown in Table raw, are replaced by their ranks as in Table ranks. The 

correlation matrix of the rank-series SFCaRl is shown in Table corr; 
it is suggestive but again the magnitudes are too small to be useful. 

Spectral Analyses — Background and An Example . A more delicate 

technique than autocorrelations for distinguishing periodicities in a 

time series is spectral analysis. Spectral analysis regards a time 

series as being the superposition of sinusoidal waves over a continuum 

of frequencies; a graph of the "spectrum" versus frequency (in cycles/day) 

exhibits the magnitude of the contribution of these harmonics at each • 

frequency to the total variance of the series. Fig. 2 is the (logarithm 

of) the spectrum versus frequency (which we refer to as the "spectral 

plot") for a series with a known strong, but not perfect, periodic 

component with 

1 (diet) cycle 

frequency = ^ days = ^*167, namely 

the series SPTIN shown in Fig. 1. (We exclude the clearly irregular 
final two periods from the analysis.) 

Notice the peak centered at about 0.165,. reflecting the diet 
periodicity of 6 days. Notice further that the spectral plot "tunes in" 
to a 3-day cycle as well (1 cycle/3 days = 0.333) as shown by the 
r peak centered at about 0.33; a closer examination of Fig. 3 reveals 
this more subtle feature which was an unintentional feature of the 


SPTFCA 


■SFCArtl, 


Table raw 


20 

■ i’ 0 . 

0. 

0. 

143.92 

24 

634.42 

256.4 

91.6 

344.47 

20 

294.25 

920.64 

692.37 

282.45 

32 

813. 12 

589.92 

310.84 

10S9. 27 


Table ranks 


20 

24 

•u 

32 


1. 

50 . 

12 , 

59 . 


1 . 

9 . 

67. 

36. 

Table corr 


1 . 

3. 

42. 

14 . 


5. 

15. 

11 . 

75. 
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EQUATION: 

sf car 1. 

(-6)); 
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ARG2 
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ARG4 

ARG5 
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1. 000 






ARG2 
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ARG3 

.01.16 7 
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' 1 . Oil 





ARG4 

-94 12 4 
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1.0*0 



km 

0.013 
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experimental design. 

The sensitivity of the spectral plot to periodicity is 
illustrated by Fig. 3 which shows SPTIN plotted against SPTIN(-3); if 

f r 

the 3-day cycle were very strong, this plot would appear as a very flat 
elliptical point-cloud — but in fact it departs little from circularity 
(the stray values come from the end of the experiment which was excluded 
from the spectral analysis) . Yet the spectral plot tunes in to this 
weak periodicity with period of 3 days. 

For those readers familiar with the technical aspects of 
computing spectral plots, we include with each of our spectral plots 
some documentation regarding the particular options used in each case: 
type of prewhitening (if any), parameter for the "fast Fournier transform" 
(FFTPARAMETER) , etc., as well as the series 1 variance in principle, but 
sometimes differing a small amount for numerical reasons). Further 
details of the algorithms used to produce these plots are available on 
request. 

A Remark on MBE Design . In an MBE with deliberately highly 
varying dietary contributions from a particular mineral, it is important 
to investigate before the experiment the harmonic structure within the 
diet cycle, lest information be obscured through having periodicities 
with cycle-lengths that are multiples of each other, making it impossible 
to distinguish which proposed effect, in fact, drives the output. It 
might, in fact, be useful to investigate an MBE which was deliberately 
designed to allow variability in intake in order to more fully examine 
'these relationships. 
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Specific Findings . Fig. 4 shows SPTUN rising slightly in 
the final periods; its spectrum (in Fig. 5, over the full range of 
SPTUN (1-101) unfortunately gives no support to the hypothesis that 
the urine recapitulates the cyclicalities of the input. The dietary 
loadings were apparently too nearly constant to hope for that. Not 
surprisingly, the more irregular SPTFN (Fig. 5a) has a spectral plot 
of inconclusive shape as well. The. situation is the same in character 
for CDR's and Pl/T's U and F plots as well. The SPTICa series, as 
might be anticipated because they both arise from the same diet, looks 
the same in character as the SPTIN series. The other diet (I) series 
does also. 
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U Spectral Plots . The spectrum of SPTUCa exhibits peaks at 
frequencies without any obvious interpretation (Fig. 6) in the context 
of the experiment; CDRUCa's spectrum (Fig. 7) shows a 2-day peak and does 
seem to recaptulate weakly the 3- and 6-day cycles in CDRICa. PLTUCa's 
spectrum (Fig. 8) seems to share the same general character although here 
periods seem closer to and 7 days (peaks at 0.29 and 0.14)' — suggesting 
that effects with a weekly or biweekly period crept into the experimental 
procedure. While we do not have any facts about the experiment to settle 
this issue, it would not be unusual for this to have occurred. 

We include the spectral plots of the three astronaut's daily 
urine volumes because these plots are surprisingly well defined in 
character; they apparently reflect a sensitivity to the input cyclicality, 
some biological rhythms, and artifacts of the experimental collection 
procedures in ways that might warrant further study, although such a 
treatment would be a rather technical digression for the present paper, 
involving exploring, e.g. , the interpretations in the experimental context 
of the possible "aliases" of the peak-frequencies. SPTVOL (shown in Fig. 9) 
while apparently random to the unaided eye, turns up a well defined 6-day 
peak in its spectrum (Fig. 10). CDRUCa's spectrum (Fig. 11) shows a clear 
3-day peak; PLTUCa's spectrum (Fig. 12) shows peaks at 6-7, 4, and 3 days 
per cycle. 

Urinary Ca, Urinary K Cross- Spectral Analysis . In a cross- spectral 
analysis of SPTUCa and SPTUK, the "coherence" , which measures the degree to 
which the two series are "in step" at different frequencies, turned out as in 

f . 

Fig. 13 — several peaks with no obvious interpretation and a peak at nearly a 2-day 
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cycle. This may reflect either shared biological rhythms, or an artifact 
of the collection procedure (because what was in the bladder at collection 

time "will be counted tomorrow even though it belongs to today") , or both. 

. f 

Fig. 14 shows the "gain of SPTUCa to SPTUK, which may be thought of as 
a regression coefficient of the periodic component in SPTUK on that with 
the same frequency in SPTUCa. The gain plot shows SPTUCa' s harmonics to 
be poor predictors, i.e., weakly correlated) with the corresponding harmonics 
Of SPTUK — except at a biweekly cycle (0.28 ZSz 2/7): tracking down this 
artifact to its source in the experimental procedure could lead to an 
improvement in accuracy. In this way spectral analysis, because of its 
sensitivity, can serve as a "troubleshooter" in the experimental procedure, 
i.e., as an exploratory diagnostic rather than its usual role as a tool 
of confirmatory data analysis. 

Fecal Series . Fig. 15 shows SPTFCa with its highly variable 
entries caused in part by the apparently random o'ccurrences of zeros 
(arising on days with no bowel movements). Fig. 16 shows SPTFCa’ s 
spectral plot (days 13 to 95); the peak at frequencies within 0.40-0.50 
are probably generated by experimental artifacts. There seems to be a 
6-7 day peak and perhaps a 3-day peak also; but these patterns are not 
articulated clearly in the spectra of PLTFCa or CDRFCa, nor are they 
strong enough to produce significantly non-zero autocorrelations of the 
corresponding orders; thus any periodic components' to the FCA series 
seems too weak to exploit for any variance-reduction purposes. It will 
be recalled from Table 1 of the text that the F series of primary concern 
are those for Ca, for which the primary excretory channel is the feces 


'Hstesswasa 


=4 
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rather than the urine, and hence 


o 2 (f) > > a 2 (u) 


Examination of the F series resulting from omitting all zeros, 

•r 

, 4 

i.e.» making the resulting series shorter by the number of zero days. 


including ANOVA's with appropriately shortened periods, e.g., the shortened 


84 5 . 

SPTFCa series is 84 days m length, and t-tt- 25S — implying an adjusted 

iUl 6 


period of 5 days), does not turn up any stronger patterns. 

i • Fecal-Urine Relationships . The character of our findings 


relating the F and U series is the same as our intra-F and intra-U 
findings; weak patterns and correlations can be ferreted out but not 
strong enough one's exist to allow any significant variance reductions. 

A cross-spectral analysis of SPTFCa and SPTUCa, for example, 
shows that their 6 -day cycles exhibit a trace of being "in step," but are 
overall undistinctive in character {Fig. 17). The same applies to the 3 
and the 5-day cycles of SPTFK and SPTUK {Fig. 18)'. The peak at 0.45 
cycle/day in these two coherence plots recurs in several other plots, 
suggesting that one of its aliases {n + 0.45, n + 0.55, n >. 0, an 
integer) may correspond to an experimental artifact that should be’ 
tracked down. For example, the hypothesis that this peak represents 
an effect caused by the length of the urine collection cycle would 
lead.,to computing the quantities •• 


24 ' 


24 


hours 


n + 0.45 ' n + 0.55 cycle 

n an integer >, 0. The most reasonable such values would seem to correspond 
* to n = 2, approximately a 10 hour cycle. We do not have the requisite 
information to settle the matter but wish to call attention again to the 
ways that spectral and cross-spectral analyses can serve as exploratory 
tools in MBE design. 




Fig. 19 shows the coherence of the F and U series for CRD's 

Ca excretion. Notice again the peak at 0.44, the weak 5-6 day peak, the 

2-day peak (probably partly an artifact both of the analysis and also of 

. r 

the experimental collection times being at fixed times) , and the peak at 
about 0.11, any of a number of whose aliases could arise for a variety of 


reasons. 
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Estimates of Measurement Errors' Contribution to <3 (NET) 

Our Source Material . This appendix makes use of additional data not 
mentioned in the text and refers to the Measurement Document by Arnold [1]. 

He proposed a model for combining the means and variances of certain 
variables of the experiment so as to yield numbers representing the mean 
and variance of the net substance change per day for any astronaut. 

There were seven "substances" — 6 minerals and nitrogen (N) . Bot input 
and output involved a number of parameters. This appendix considers 
only nitrogen (N) and calcium (Ca) . Measurement tables yielding quantities 
relevant to the model were available to us. Those we used are referenced. 

Their acronyms for our convenience in referencing are also included. 

What We Did . We have done what we believe can be regarded as a 
representative calculation. We chose to particularize our calculation 
to the case of 

Cmdr . Crippen 
Day 2 

Nitrogen and Calcium 

only. 

The claim that these calculations are representative is one we do 
not substantiate in detail here, and indeed going through the calculation 
for other substances, days, and men would be necessary to confirm this 
firmly. Our conclucion is a sufficiently decisive one as to the small 
contribution of measurement error to the total variance that we do not 
regard this as a crucial matter. 

We have simplified Arnold's input model and changed his output 
model so as to allow using the data in our possession. This involved our making 


V 
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reasonable estimates (described below) for a few necessary parameters 
based in one way or another on the data. 

Our Plan in this Note . We shall state our final conclusion first, 

, f 

then go to particulars. The experiment, and hence the model, had three 
"phases" — input, urine and feces. For each phase we describe our model, 
how we obtain the parameter values necessary for the model, and identify 
any assumptions and what they are based on as they are needed. 

It will turn, out that where we had to make reasonable approximations, 
the outcome will show itself not to turn on the particular values we 
obtained thereby. Finally, we do the actual arithmetic to get the mean 
and variance of the substance — input or output for each phase. 

We perform the entire sequence of calculations for each of N 
and Ca in turn. The following sections relate to the N findings. 
principal Conclusions 

Statements . We found the magnitudes in the measurement uncertainties 

associated with each phase to be in the proportions approximately of 

input: feces: urine , 

r 1 : 10:100 

Moreover, the analysis showed that almost all this ± 60% has its source 
in a single measurement — the concentration of substance in the urine . 

Confidence in the Findings . We return to these three points at the 
conclusion, but these are essentially the reasons we regard our findings 
as trustworthy. 

i) it confirms quantitatively the intuition of experts 

ii) the consistency in relative and absolute orders-of magnitude 

of our derived-from-the-data parameters and the pre-experiment 
Arnold [1] values, making us feel that our estimates based on 
the data represented a refinement of the representative values 
to the particular experimental procedure used in the MBE. 

iii) as an internal confirmation, we shall show moreover that any 
approximations we are obliged to make occur in a role 
influencing only negligibly the final answers. 
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Input 

Model Variables . Conforming to Arnold's [1] (p.59) notation, we 
define, for a particular food the random variables: 

P = substance (N in this catenation) content of portion 

A = substance content of standard size portion 

C = ratio of true weight of portion to weight 
of standard portion 

D = weight or residue when portion is judged 
completely consumed 

We find it convenient to introduce the symbols (not appearing in Arnold [1]) 

H = total (rehydrated) weight of standard portion 
(In Arnold's notation, H = S + K • W, but this definition and the form in 
which the referenced tables are given allow us to avoid any direct mention 
of these variables on the right hand side.) 

More imput variables (Mi,M 2 ) are given in Arnold (p.59), but we 
make an assumption that removes any necessity to consider them. 

Assumption. All foods were completely consumed. 

Basis for Assumption. 

(1) NASA has asserted to us that in all three Skylab missions only 

about 10 times did it occur that a food was reported as not being completely 
consumed . . 

(2) In retrospect, given our results, this assumption cannot change 
the character of the conclusions from our numbers . 

Relations Among Parameters . We shall find it useful (as does Arnold, 
p. 12) that if X,Y are independent random variables, then, 

(i) E(X-Y) = E (X) • E (Y) and 

(ii) a 2 (X-Y) = [a 2 (x) + e 2 (x)] • [a 2 (Y) +e 2 (y)3 - e 2 (x) e 2 (y) 
which may be rewritten as 

— cr 2 (x) + e 2 (x)1 (o 2 ( y)) + a 2 (x) e(y) 
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Note . 'Proof of (ii) . (ii) follows from (i) , the definition of variance 
for any random variable Z: 

a 2 (Z) = E (X 2 ) - E 2 <Z) , 

' i* 

and the fact that X,Y independent implies 


E(X 2 Y 2 ) - E (X 2 ) E(Y 2 ) . 

Remark on our presentation. Subject to our modifications, our 

references to Arnold [1] should be construed as for brevity's sake. We 

are not leaning on the prior document in any substantive way. Rather, we 

have reconsidered independently the most sensible way to model the input 

and find that it overlaps sufficiently with the previous model that will 

save space and be clearer if we refer to it. The way we obtain particular 

numerical values for some of these parameters will be entirely independent 

of Arnold. At the end we compare. 

Explicit formulas of the model. We define as does Arnold 

B = ratio of weight of portion consumed to weight of 
standard portion 

We shall find it convenient to introduce the notation R = ~ . R is the 

H 

fraction of the total portion weight that is residue (when the portion 
is judged completely consumed) . We take, as does Arnold, H to be 
non- random, so that 


E(R) 


E(D) 

H 


We have 


and 


B = C - R (Arnold, p.ll) 

P = A * B (Arnold, p.10) 

so by (i) and (ii) therefore, the expected value of the intake is 
(1) E (P) = E (A) • (E (C) - E (R) ] 
and its variance is 


( 2 ) 
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[O- 2 (A) + E 2 (A) ] a 2 (C) J / E 2 (A) ' • a 2 (C) 

=(+ [a 2 (A) + E 2 (A)]« a 2 (R)^ -(+ E 2 (A) * a 2 (r) 

+ a 2 (A) • [E (C) - E(R)] 2 / ( + e 2 (c) • a 2 (A) 


We suppress subscripts throughout, but it must be remembered that each 
variable is associated with one particular food portion , i.e., (1) and (2) 
give the amount and variance of substance intake for each portion. To 
find the amount and variance of total intake for a particular day, we add 
those for each portion in the day’s diet. Thus, the problem reduces to 
finding values for each food of the parameters on the right hand dise of 
(1) and (2). We now say how we arrived at values for each of these. 

Estimating Parameters . 

Parameter: E(C) 

A 

Assumption: E(C) =1 

Basis: (1) The standard weights were asserted to us to have 
been "unbiased," i.e., rdughly as likely to be about as much above or 
below the true portion weights for any portion. (2) Same as (2) of 

assumption on page 3 of this Appendix. 

2 

Parameter: 0 (C) 

A 

Assumption: 0(C) = 0.01 for all foods. 

Basis: We were told in personal conversation that NASA 

arbitrarily adopted the standard of keeping each food's actual weight of 
standard portion to within ± 2 to 2.5% 6f the specified standard portion 
weight. (Examining preliminary data tells us there may have been a few 
exceptions to this.) But taking 2% = 0.02 to be 2 standard deviations 
(20), we are led to the assumed estimate below. 


1 
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Parameter: E (A) . E (A) may be read directly from the 9 sample 

table under the column headed MEAN, in the row for nitrogen-- one value for 
each food. 

■ i* Parameter: a? { A), a 2 (A) comes from the same row of the same 

table, as the square of the item in the column STD DEV. We have found 
it most natural to obtain directly from the data the mean and variance 
of the parameter R, as follows: 

Parameters: E{R), CT 2 (R) . To avoid unnecessary calculation and 

to deal with the fact that some foods occur so seldom in the diet that 
individual values of E (R) and a 2 (R) could not be based upon a satisfactorily 
large sample, we make the following assumption. 

Assumption: Any two foods contained in the same container-type 

(can) share the same values of E (R) and a 2 (R) . 

Basis: (1) An examination of Table 9 in Arnold would show that 

the table was prepared under the same assumption. This is in accord with 

reasonable common sense. (2) Retrospective order-of -magnitude examination 

of the model equation for the input variance (a 2 (P) ) will show the variable 

E(R) to be the least important variable affecting the final variance-of-net- 

change of any quantity in the model . Its influence on the expected value 

of the net change will be shown to be inconsequential as well. (3) Examina- 

tion of actual data values comparing observed E(R), cr (R)'s for different 

foods in the same type container from the residue table showed that this 

assumption held to high accuracy. 

How does this assumption work out in practice? Examining Table 9 in 
Arnold shows the counts for number of foods for each can type (Chart 1) . 
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Chart 1. 

TYPE 

I 

II 

III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 


Number of Foods in Diet in Each Container Type 

NUMBER OF FOODS 

2 

2 

1 

5 

16 

4 

26 

11 

1 

1 

0 

1 


Assumption: For foods in container of Types I, IV, VI, IX and X. 

E(R) = 0, a 2 (R) = 0 

Basis: Examining the residue table shows these foods, being 

generally "hard" non-viscous substances, to have vanishingly small residues, 
which we attribute to the measurement procedure rather than to food really 
leftover. Examining Chart 1 shows that most foods were in containers 
either of Types V, VII or VIII. Thus, there are ample observations in 

A A n 

the residue table from which to obtain E (R) and O (R) directly from 
the data for each of these three can types. 

Our method was to pick 20 foods (informally) at random from 
Crippen's in-chamber days, allowing for the possibility that E(R), cr 2 (R) 
might be different for different astronauts. Columns headed "residue" 
and "total" are -values of D and H, respectively. We divided the D for 
each of our sample portions by its associated H to find R^, say. 

We then set 


E (R) = 


_ 1 _ 

20 


20 

R. 

1=1 l 


R 


and 


8 


? i 20 — 2 

cr 2 (R> - l (R r R) 2 
20 i=1 

We thought 20 foods would be a large enough sample to obtain accurate enough 

A An 

values for E (R) , O (R) . Chart 2 shows the calculation for can type VII 


with the decimal point moved over for convenience in the last column. 

Note . Here, as in all the charts in this Appendix, we include an 
exaggerated number of significant figures in our numbers. This is because 


we regard these quantities as intermediates toward the computation of E 
and a 2 for NET, and it is standard numerical-analytic common sense to round 
off at the end, not before. 


For other less common can types, observations were harder to come by. 


This is not a flaw to be Worried over, however, if we remember that it 
is exactly these foods that occur less often in the diet so their 


parameter values entered less often. Can types III and XII occurred so 
rarely that it was not feasible to keep their parameters astronaut-specific. 


TOTALS 


Chart 2. 

Crippen (in- 

-chamber values) 

Can Type VII 



D 

/.D E (D)\ 

D 

H 

H 

\H H / 

0.79 

68.4 

0.0115 

8 

1.07 

124.5 

0.0086 

1011 

0.22 

75.0 

0.0029 

7885 

0.42 

124.5 

0.0034 

7022 

0.84 

171.8 

0.0049 

4733 

6.28 

110.5 

0.0568 

202680 

2.18 

172.5 

0.0126 

67 

0.00 

167.6 

0.0000 

13924 

3.08 

199.0 

0.0155 

1384 

2.35 

156.0 

0.0151 

‘ 1102 

1.38 

68.4 

0.0202 

7089 

1.17 

124.5 

0.0094 

576 

0.02 

75.0 

0.0003 

13225 

1.50 

196.5 

0.0076 

1764 

2.09 

172.5 

0.0121 

9 

1.37 

110.5 

0.0124 

36 

1.67 

124.5 

0.0134 

256 

1.05 

199.2 

0.0053 

4225 

0.91 

183.0 

0.0050 

4624 



0.2356 

276244 

A 


" s 

A o 


E(D) 

Tt 

= 0.01178 

* D) - 13812 • 10 -8 
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Chart 3 shows the values of E (R) and 0 (R) computed by this scheme, and 
the number of observations in the sample for each can type. 


This completes our specification of the input calculation. 


/’Chart 3. 

n = Number 

of Foods 

Used to Compute 

Statistics 


CAN TYPE 

10 2 -E(R) 

(l-E(R) ) 

(l-E(R)) 2 10 4 -(O 2 (R)) 

n 

II 

2.090 

0.9791 

0.9586 

5.79 

8 

III 

0.424 

0.9958 

0.9916 

0.0321 

5 

V 

0.772 

0.9923 

0.9847 

0.1571 

20 

VII 

1.178 

0.9882 

0.9765 

1.381 

20 

VIII 

2.269 

0.9773 

0.9551 

1.777 

20 

XII 

2.380 

0.9762 

0.9530 

5.08 

9 


Can types I, IV, .VI, IX, X: 

E (R) = a 2 (R) HO. 

Model Calcuations . Having specified all parameters and described 
how we arrived at a numerical value of each, we now carry out explicitly 
the arithmetic dictated by the model calcuations. Recall again that we 
are -focusing on day 2 of Cmd. Crippen (See Arnold, p,53 for this diet 
menu on that day) . 

Chart 4 exhibits the contributions of each portion on day 2 to 
the total input amount (E(P)) and variance (a 2 (P) ) both according to "term 
of the sunt" in the 0 2 (P) equation and "food." Chart 4 and the arithmetic 
summing totals for each column shows several findings. 

Findings . 

A 

1. Referring to Chart 3 for the values of E (R) and its square, 
we see from the closeness of the values for different can types that a 
uniform value for all types would have sufficed in the E (P) column. 

2. A small fraction of the foods contribute almost all the N, 
and these same foods contribute almost all the uncertainty in N. 


Chart 4. 

Contributions of 

Each 

Food in Crippen's 

Day 2 

Diet to E (P) [eqn. 

(1 ) , p.4l, and to Each 

of 


“the Three Terms 

[eqn. 

(2), p.5J for 0 2 (P) 

, the 

Latter Shown x io 6 

to Avoid a Clutter of 

Zeros 

Food No. 

E(P) 10 6 

x [a 

2 (A)+E 2 (A)]*a 2 (C) 

10 6 x 

[a 2 (A)+E Z (A)] 

10 6 x g 2 (A) • [E(C) - 

E(D) 2 
H J 

38 

0 


0 


0 

0 


16 

2.2175 


504.6 


696.85 

9570.68 


13 

2.5249 


653.4 


902.35’ 

5492.81 


8 

0.0301 


0.7 


1,24 

0.96 


8 

0.0801 


0.7 


1.24 

.0.96 


62 

0.0723 


0.5 


0.89 

0 


62 

0.0723 


0.5 


0.89 

0 


71 

4.9074 


2466.1 


3405.68 

47.85 


65 

0.5583 


31.9 


44.05 

249.98 


47 

0 


0 


0 

D 


19 

0.5580 


31.7 


4.98 

15.75 


42 

0 


0 


0 

0 


42 

0 


0 


0 

0 


62 

0.0723 


0.5 


0.89 

0 


39 

3.7400 


1420.9 


223.22 

3908.27 


29 

0.5089 


26.5 


36.60 

24.41 


69 

0. 3666 


13.8 


19.06 

316.39 


8 

0.0801 


0.7 


1.24 

0.96 


66 

0 


0 


0 

0 


66 

0 


0 


0 

0 


62 

0.0723 


0.5 


0.89 

0 


62 

0.0723 


0.5 


0.89 

0 


62 

0.0723 


0.5 


0.89 

0 


4 

0 


0 


0 

0 


42 

0 


0 


0 

0 


42 

0 


0 


0 

0 


60 

0 


0 


0 

0 


TOTALS 

16.056 


5744.5 


5425.56 

35008.01 



Thus , 

E (P) = 16.056 

CJ 2 (P) = (5.7445 + 5.42556 + 25.00801 x 10“ 3 = 0.046178 g 
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Implication of (2) . Some preliminary experiments could have been 
done to see which foods contribute negligibly to each substance's total, 
thus reducing the number of measurements that have had to have been carried 
along through the experiments themselves, for N at least . 

Output — Feces 

Model Variables . We define : 

Nj = g of nitrogen/fecal sample 
p = percent of sample that is N 
f = g of feces in sample 

Our model uses somewhat differently defined variables than in Arnold 

i 

although it is the same in concept, namely that 

(*) total substance = ( fraction of sample\ • (size of sample) 

\that is substance / 


Thus, we have 


N f = p • 


f 


Relations Among Parameters . We need not involve equation (i) to 
obtain E(N f ) because this quantity may be read directly from the daily 
means table in the N column for each day. Using equation (ii) implies 


(5) a 2 (N f ) = a 2 (p) a 2 (f) + a 2 ( P ) e 2 (f ) + e 2 (p) a 2 (f) 

Estimating Parameters . We now tell how we obtained estimates of each 
quantity on the right hand side of (5) 

Parameter: E(p). E(p) was obtained directly from the p-% 

duplicates table simply by averaging both members of every pair for many 
pairs spread over all three phases of SMEAT (pre-, in-, and post-chamber). 
Averaging a sample of about one-third the total number of observations in 
the table, we find E(p) - 4.204058. 


1 


i 
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Parameter: o 2 (p) . We may estimate this because values in the 

p-% duplicate table are in duplicate. 

If p^, Pi 2 are the random variables representing the duplicate 

determinations of p on the ith sample, or the actual sample values 

depending on our context, and if , 

Pi + Pi 

p = — — l ~ 

* 2 

then 

a 2 {p ii ) = aZ (Pi 2 ) = E(p ±i ) - E(Pi i ) 2 
Because E(p^ ) = p^ , we may take as the sample estimate 

■*•1 l 

„ Pi + Pi 

a 2 (Pi) = (Pi ) — ^->2 


which is equal to 

(6) teij, + p i 2 ) 2 

4 

t 

For a set of samples with sample numbers {i} over which a 2 (p) is 

^2 

constant, an appropriate value for a (p) would therefore be the average 
of quantities (6) over all indices in the set {i}. 

Thus, if we were guaranteed that a 2 (p) would not change over our 
entire range of samples, we would take the differences between members 
of every duplicate pair, square them and divide by 4, and the average of 
all these would be our estimate of a 2 (p) . Are we justified in assuming 
a 2 (p) constant over the entire range of sample numbers in the p-% duplicates 
table? Examining the data shows we definitely are not. Taking subsamples 
from three locations in the table reveals the sample variance estimates 
obtained by the above scheme in each section as in Chart 5. 



42-57 

13 

0.00166 

81-96 

14 

0.00165 

222-233 * 

10 

0.01580 


Thus, for a reason we do not know nor find a need for this analysis, 
the variability in the highest sample numbers is an order-of-magnitude 
greater than that in the rest of the measurements. 

Assumption. A representative value for 0 2 (p) in the entire sample may 

be obtained by taking a weighted average of thw to estimates from Chart 5, 

namely 10 4 x 0.00166 and 10“ 4 x 0.01580, weighted 11:5, respectively. 

Basis for Assumption. SMEAT's timetable was 

Phase No. of 6-day Periods/Phase 

pre- chamber 4 

in-chamber 9 

post-chamber 3 

The high sample numbers, we suspect, were obtained during the 
post-chamber phase, and because the pre-chamber periods tended to be a 
troublesome time in SMEAT, we attach the larger variance-figure to half 
(arbitrarily) the pre-chamber period as well as to the post-chamber period 
making the appropriate weights on the smaller and larger variance 


and — , 

16 


where t = 3+^(4) = 5. 


Thus, we take as our estimate of 0 (p) for SMEAT the value 

a 2 ( ) _ (ID (0.00166) + (5) (0.01580) x 10 -*> 

16 

= 0.006079 x 10~ 4 



Parameter. E(f). Here we had the problem that we were not 

provided with the correspondence in labeling between the two tables, 
daily means and p-% duplicates, and also were not given any values at all 
for/’ f. We do not viewthis as a serious drawback, however, because we 
are doing a representative calculation here and what is therefore important 
are representative values which we obtain by averaging or "smoothing." 
Because of the irregular and discrete quality of fecal sampling, some 
smoothing would almost inevitably be desirable in any statistical 
evaluation of these results even in the presence of full data on the 
variable, f. Thus, we regard the following averaging to yield a value 
for E(f) as quite satisfactory and trustworthy for this evaluation. 

Although we are not given values on f itself, we are given 
values on a fecal variable besides N f and p, namely 

K = amount N per fecal determination (g) 

Our plan is to obtain from the daily means table a suitable average 

A 

value of K, say K, and then take 

A 

(11) E(f) - 

E (p) 

A 

Multiplying both sides by E(p), this simply says 


(7) f representative value of 
I expected size of fecal 
\ sample 


of fecal 
that is 


sample] 

H / 


i 


\ 


average amount N 
i in a fecal sample J 


To obtain K, we compute initially three separate values for the three 
experimental phases because evacuation habits can be presumed to differ 
in the phases. If is the value of K on the ith fecal determination. 


24.470 

15 


1.6313 



K 

pre 


E 

i=l 

15 


K i = 



„„ 48.528 

ihU = 35 

35 


_ 60 

K post “ j= 5 ^i 
10 


22.960 

10 


1.3865 


2.2960 


thus showing that stratifying by phases was indeed called for. We take K 
as the weighted average of these three phase-specific estimates, with 
weights proportional to the fraction of SMEAT in each phase: 



4 K 


pre 


+ 9 K. 
in 


16 


+ 3 K 
post 


yielding 

K = 1.6182 g 

whereby from (11) we find 

A 1 ft? 

(12a) E(f) - 4V 2 04058 x-16- . - 38 - 4914 

and 

(12b) E 2 (f ) = 1481.59 

Because we want to do the calculation for a one-day period and 
because feces appear irregularly, we need a value for mean g N from feces 

A 

per day.* This may be obtained by smoothing K to find this number as if 
there were a fecal sample of the same size every day fo SMEAT. We have 

A 

E (Nf ) equal to 

mean g N _ mean g N # number of determinations 

day determination number of days 

_ A 60 
_ K ' (6) (16) 

because there were 60 fecal samples in SMEAT (i goes up to 60 in computing 
the K’s). There were 16 6-day periods in SMEAT. 
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( 60 2 
V 96 ; 


Model Calculations . Computation yields 

E (N f ) = 1.0114 g/day 

By (5) , we have 

-j f 10 4 X O 2 (Nf ) = {0.006079) O 2 {f ) + (0. 006079) (1481. 59) 

+ (17.674104) a 2 (f ) 
or 

(9) 0 2 (Nf) = [(17.68017) a 2 (f ) + 3.51797] x 10" 4 . 

Assumption : 

(8) CT 2 (f) = 0.01 E (f ) . 

Basis for Assumption: 

(1) If, as might seem reasonable, f is measured to within ± 2% 

and we take this to be "20’," then equation (8) holds. If the figure 2% 

were not itself so arbitrary, we should multiply the 0(f) m (8) by 
60 

(TTTTT) to obtain an "equivalent daily" standard deviation, but nothing 
b lib; 

crucial hinges on this admustment as we 

(2) shall see. retrospectively : the fecal measurments are the most 

accurate part of tie experiment and the exact number here will influence 
the final answer only negligibly. Equation (8) implies that 


O z (f) = 10“ 4 E 2 (f ) = [0.14816 (r70 z ] = 0.05788 

96 

so the first term in brackets in (9) becomes with this assumption 


60. 


(17.68017) (0.05788) = 1.02333 

which is considerably smaller (although not negligible) with respect to 
3.51797 even still in this "intra-fecal" analysis. 

We find with this assumption 


0 2 (N f ) = 4.5413 x 10- 4 g 2 
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Output — Urine 

Model Variables. We define: 


N u = 9 N from urine per day 


• c = concentration of N in urine (mg/100 cc) 

V = daily urine volume (ml) 

Relations Among Parameters . Again, (*) expresses the idea relating 
the variables: 

N u = (10“ 5 c) • V 

The factor comes by remembering that 1 ml = 1 cc 
From (i) and (ii) , 

E(N U ) = E (c) • E(V) • 10" 5 

a 2 (N u ) = {a 2 (c) a 2 (v) + a 2 (c) e 2 (v) + e 2 (c) a 2 (v)} x lo “ 10 
Estimating Parameters . 

Parameter: E(c). The c-duplicates table gives duplicate 

determinations and their means. Examining the data shows no strong trend 
over time, so we may be satisfied with averaging many c-values spread 

A 

over the whole of the experiment to obtain E(c). Doing this yields 
E (c) = 549.06 


mg 


100 cc 

Parameter: a 2 (c) . We compute the quantity 

(13) [(larger value) - (mean of pair)] 2 

for 20 pairs of Crippen's in-chamber c-values. In-chamber was chosen 
because, looking at the experiment as a whole, this appears to have been 
the most stable phase. Thus, the figure we obtain may be slightly on 
the small side. We obtained these 20 values of (13) (A number in parentheses 
counts how many times the number preceding occurred.) Ordered by size, thay 
are: 1(3); 10, 25, 36, 49, 64, (2); 81, 100, 144 (2); 225 (2); 256, 361, 

784 (2)’; 1024. Their average is 

° 2(c > = 218 - 9 <K )2 
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Parameter: E(V). V7e obtain a value for E(V) in the same way 

A 

as for E(c) — by averaging in-chamber V values. 

We obtain 

E (V) ~ 2499 ml 

Parameter: a 2 (V) 

Assumption : ^ 

a 2 (V) = 10 ml 2 

Basis: We are not given any direct information about the 

measurement accuracy of V, but we can surmise from examining (a) the 
significant figures in their sample values and (b) the units-digits in 
these numbers that they have been measured to within about ± 5-7 ml. 
Taking the midpoint of this interval, 6, to be ''2a," then "a" = 3 and 
"a 2 " = 9, which we found off to 10. [See section on Findings below.] 
Model Calculations . Thus, we may compute 

E(N U ) = (549.06) (2499) 10" 5 = 13.721 g 


and 


term 


£ 2 (n u ) = {a 2 (c) a 2 (v) + a 2 (c) e 2 (v) + e 2 (c) a 2 (v) } io -10 

= {(218.9) (10) + (218.9) (2499) 2 + (549 . 06) 2 (10) } 10“ 10 
= (0.002189 + 1367.8 + 3.0147} * 10 -4 
= 1370.8 x 10“ 4 or O 2 (N u ) = 0.1371 g 2 
Findings . Essentially the entire variance in urine N comes from the 

An 

a 2 (c) e 2 (v) 


Thus, still within this "intra-urine" analysis, we. may conclude that the 
exact value chosen for o (v) will influence negligibly the total 
uncertainty in urine output. We claimed this previously. 

Implication . So far as establishing measurement error goes, comparatively 
little attention in the experimental procedure need be paid to especially 
accurate measurement of V and essentially all attention should be focused 
on making O 2 (c) as small as possible. 


Net Intake 


Model Variables . Here we combine the "intra" analyses to find the 
mean and measurement variance of the net intake, say NET. Estimating this 
quantity is the goal of the MBE. We have 

NET = P - N f - N u , so 

Relations Among Parameters . 

(15) E (NET) = E(P) - E(N f ) - E (N u ) 

(16) a 2 (NET) = a 2 (P) + a 2 (N f ) + a 2 (n u ) 

Estimating Parameters and Model Calculations. Thus, using our "intra" 
results 

(17) E (NET) = 16.056 - 1.0114 - 13.721 = 1.3236\g 

(18) a 2 (NET) = 0.046178 + 0.00045413 + 0.13708 

0 2 (NET) = 0.18371 0 (NET) = 0.42862 

(19) E (NET) ± 20 (NET) = + 1.3236 ± 0.8572 

is thus a one-day MBE confidence interval associated with measurement error 
only. Its scope of inference is the restricted one associated with 
"measurement error only" as described in the text under the sections headed 
"The Restricted Inference" and "Status of the Arnold Document [1] . " The 
confidence interval given above in (19) is not quite overlapping zero, but 
close enough that confidence even in the direction of net change (i.e., 
did he gain or lose the substance?) must be very weak. Confidence in the 
numerical result itself is totally unwarranted. 2a (NET) is 65% of E (NET) . 

Now notice from where this uncertainty comes. Chart 6 shows the 
percentage of the total variance contributed by the variances from each 
phase of the experiment. 
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' Chart 6 
PHASE 

Input Feces ' Urine 

25.0 0.6 74.3 

• ? * 

Art An 

Now recalling the calculation of 0 (N u ) , we found that 0 (N u ) is due 
almost entirely to the term involving cr (c) . Our conclusion, therefore, 
is that the measurement of substance concentration in urine is contributing 
three-quarters of the measurement error. . 

Magnitudes of the uncertainty in the fecal, input and urine phases 
of the experiment go like 

s ! 1 • 3 

The calculation of E (NET) shows that far more nitrogen is coming 
from the urine than from the feces. Although in principle not relevant 
to the variance calculation because the model assumes homogeneity of all 
variances (i.e. , independent of the size of the variable being measured), 
this is for mathematical simplicity, and our assuming it does not make it 

true; in fact such dependence is quite common. 

In the presence of any such dependence, the fact that mosu ou^ut 

nitrogen comes in the urine further underscores the importance of reducing 
a 2 (c) in any MBE where restricted inferences are important, i.e., in any 
experiment whose particular outcome is important (again as distinguished 
from inferences with the wider scope as described in the text) . 

Confidence in the Findings . 

1) Finding (2) gives quantitative support to the intuition of 
experts in the field of biomedical experiments that this measurement is an 
uncertain one. The present calculation shows that this one measurement 
produces a final uncertainty that is ,2a order-of-magnitude greater than 
that generated by the entire input phase. 


J 
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2) Our approximations have been sensible and conservative, and 
have been shown even "intra "-phase not to influence the answer much in 
any case; this is highly strengthened when we view how negligible their 
effects are on the net answers. 

. i 

3) Our parameter estimates were computed on the basis of the 
SMEAT data themselves and are thus "tailored to this particular experiment." 
This view, that our "hatted" quantities are refinements of estimates of 
these quantitit.es appearing in laboratory manuals, or otherwise able to 

be anticipated approximately before the experiment, is substantiated by 

Chart 7, which exhibits the consistency in orders-of-magnitude of our 

estimates with those in Arnold, these latter representing the best available 

estimates of these parameters before SMEAT. 

Chart 7. Output Parameter Estimates Calculated by this Paper from 
Pre-Skylab Data, compared with Arnold's values 


Parameter 

SMEAT Value, g 2 

(Arnold) 

Our "Hatted" Estimate 

a 2 (p) 

8.19925 x 1CT 

7 (p. 48) 

6.079 x io~ 7 

a 2 (f ) 

2.5 x icr 5 

(p. 48) 

14816 x 10“ 5 

a 2 (c) 

0.7050 X 10“ 8 

(p. 49) 

2.189 x io~ 8 

E(f) 

20.74 

(P* 57) 

38.49, 

E(c) 

9.00 mg/ml 

(p. 58) 

5 . 49 mg/ml 

E (V) 

2311 ml 

(p. 58) 

2499.7 ml 

a 2 (v) 

Not given 


10 

E (p) 

0.050 g/g 

(P* 57) 

0.04204. g/g 


Discussion of Chart 7 . (a) The two parameters with sharply discrepant 

values are a 2 (f ) and a 2 (c) . We arrived at a value of o 2 (f) very arbitrarily 
and Arnold is therefore telling us that we were highly overly pessimistic. 
But because the fecal phase even with the pessimistic assumption is the most 
accurate in the experiment, we do not dwell upon this discrepancy. 

(b-i) The other discrepancy involves a 2 (c) , the most important parameter 
in the calculation. We would be more suspicious of our computed value of 
0 (c) as being somehow unrealistically large, were it not for the fact 


that we explicitly acknowledged a possible bias in favor of the well- 
controlled period of SMEAT, the in-chamber period. 

Thus, we conclude that our finding shows that the c-measurements not 
to.^ave been as accurate as it was believed prior to SMEAT that they could 
be. (b-ii) The character of our findings remain unchanged with the 

_ A 

Arnold a 1 2 (c) value instead of our a 2 (c) , although now the terms in 
which one would describe the finding become a little more moderate. 

Overview of Nitrogen Calculations 

The present remarks give a quantitative measure to the relative 
attention that should be paid in the context of measurement error and 
restricted inference (see text) to different phases of an MBE, and they 
confirm the qualitative feeling of practitioners. They also answer a 
specific question about nitrogen intake. 

The Corresponding Calculations for Calcium 

Unlike nitrogen, the primary excretory source of the substance 
Ca is the feces. Because the urine mode made the largest contribution 
to total measurement variance for N, it is of interest to see to what 
extent this N result is sustained in this different Ca situation. 

We begin with Table SAME which exhibits the parameters that are 
"substance-dependent" and therefore whose values remain the same in principle 
for the Ca calculation as for the N calculation. 


Table SAME 

Model Parameter Estimates with the Same Value in N and Ca Calculations 
Parameter Estimate 


E(Ci) 
CT 2 (Ci) 
E(Ri) 


0 2 (%) 
E(V) 


1 X IQ- 4 

As given for each can type 
on p. (Progress Report) 
Same as above 
2499.7 ml 


a 2 (v) 


10 ml 2 


E (f ) , a (f) (approximately, see text) 
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The following sections summarize our computations of the remaining 

parameter-estimates in an order paralleling that of the N calculation. 

A 

Table P Ca exhibits for Ca the contribution from each food to E(P) 
and-'to each term in the- sum (2) representing cr (P) . Again a small fraction 

A 

of the foods contribute to E (P) , and these same foods also contribute 

, 

most of a (F) . Thus, the implication that the number of measurements 
taken in the MBE could have been reduced substantially applies for Ca 
as well as for N . 

VN 

Parameter: E(P Ca ). To obtain E(p) for Ca, we used the p-% 

duplicates table for N as follows. The average of 

q Ca 

ihr = r ' say ' 

over many samples spread over all phases is: 

r = ,0.4080597. 

A A 

Multiplying this by E(p) for N yields our estimate of E(p) for Ca: 

B(PCa) = (0.4080597) (4.204058) = 1.715507 


2 

Parameter: O (p c a) . For 0 (p Ca ) > we used 



(PCadata^ 


a 2 (7 * P N ) 

r 2 a 2 (p N ) 

(0.4070597) 2 ; (6.079 * 10~ 7 ) 
0.10122308 x 10“ 6 


Because of the somewhat ad hoc character of the above choices, we 

A n 

shall also "carry along" the value of 0 (Pca) offered in Arnold: 

^ (p Ca A rnold ) = 0.000321 x i 0 - 6 . 

Notice that they are indeed discrepant. At the end we see how big 
a difference this makes in the NET variance. 


w 


Table 

^Ca 

V»>- 

Food No. 

E(P) 

10 2 * [a 2 (A)+E 2 (A) I a 2 {c) 

10 2 X [a 2 (A)+E 2 (A)] a 2 (R) 

10 2 X a 2 (A) tl-E(R)] 

38 

22.4954 

5.1907 

0.8155 

509.19 

16 

146.2091 

219.1243 

304.8019 

2122.33 

13 

10.2032 

1.0799 

1.5021 

'*•. 135.53 

8 

69.3433 

50.5550 

89.8362 

2005.23 

8 

69.3433 

• 50.5550 

89.8362 

2005.23 

62 

4.2601 

0.1906 

0.3387 

5.92 

62 

4.2601 

0.1906 

0.3387 

5,92 

71 

23.8907 

5.8496 

8.1368 

45.31 

65 

3.6257 

0.1370 

0.1906 

19.33 

47 

10.7139 

1.1713 

0.1840 

52.58 

19 

29.0526 

8.5737 

1.3469 

16.44 

42 

3.0062 

0.0957 

0.1701 

6.30 

42 

3.0062 

0.0957 

0.1701 

6.30 

62 

4.2601 

0.1906 

0.3387 

5.92 

39 

271.3494 

748.0230 

117.5144 

2432.31 

29 

69.9626 

50.1545 

69.7649 

297.34 

69 

30.3160 

9.4269 

13.1128 

126.26 

8 

69.3433 

50.5550 

89.8362 

2005.23 

66 

0-6245 

0.0040 

0.0071 

0.29 

66 

0.6245 

0.0040 

0.0071 

0.29 

62 

4.2601 

0.1906 

0.3387 

5.92 

62 

4.2601 

0.1906 

0.3387 

5.92 

62 

4.2601 

0.1906 

0.3387 

5.92 

4 

10.6751 

1.2060 

6.9827 

165.26 

42 

3.0062 

0.0957 

0.1701 

6.30 

42 

3.0062 • 

0.0957 

0.1701 

6.30 

60 

81.8762 

70.4723 

125.2293 

2715.35 

E(P) = 

957.2342 mg 

1273.6080 

921.8173 

14714.22 


So 

10 2 x a 2 (p) = 16909.65 mg 2 
a 2 < P) = 169.0965 mg 2 


C(P) = 13.003711 mg 
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Parameter: E(f). Although E(f) should not in principle change 

(see Table SAME) , because of some mi.ssing values for N which were present 

t * A 

in the Ca fecal table for CDR, we may recompute E(f) with a few additional 
samples to find that its'value does change slightly from (12a); by (11) we 
have 

712.405 x 10" 3 g 
E(1> " 17715507 V lO-? 

= 41.527 g 

A 

Parameter: K. Again, the few missing N values led to a slightly 

A 

larger sample for the K calculation for Ca: parallel to that for N, we find 

Table K. 

Table K 


Phase No. 

of Observations 

K (by phase) in 

Pre-chamber 

16 

623.35 

In-chamber 

38 

710.28 

Post- chamber 

11 

837.52 


From Table K, we compute 

_ 4(623.35) + 9(710.28) + 3(837.52) 


K = 


16 


- 712.405 mg 


A A 

parameter: E(Ca f ). Using K, we find for E(Caf) corresponding 

A 

to the calculation for E(N f ). 


we find 


/v 65 

E(Ca f ) = (712.405) (~— lfi) ) = 482.358 mg 

Parameter: O 2 (f ) . Assuming as we did for N that 0(f) = .01 E(f), 
O 2 (f data ) = 10 ~ 4 X E2(f) = 10-4 x (41.527) 2 = .17245 g 2 


We may compare this value with the Arnold value 

a 2 (f ,J 51 2.5 * io~V 

Arnold ^ 

A p 

to find that, as for N, the former o (f) is again probably an overly 

data 

pessimistic estimate of 0 2 (f). 


o 

Parameter: o'~ (Ca^) . Combining the foregoing parameter estimates 


according to 


a 2 (ca f ) = a 2 (p) S* (f) + a 2 {p) e 2 (f ) + e 2 (p) a 2 (f) 
allows us tu write, carrying along both the "data" and the Arnold estimates 


for a 2 (p) and (f ) 


f". 000321 X IQ" 6 *V 2.5 x 10“ 5 


"L 10122308 x iq-^ 17245 x io 


_ + (41.527) 




(.01715507) 


( 2.5 X io- 5 ] 
17245 x 10“ 5 / 


Taking the smaller of each pair to produce a lower bound and the 
larger to yield an upper bound, we find a pair of estimates for O z (Ca^r) as 
follows : 


.0008025 x 10 
1745.592 x io 


-ll 
_li + 


.55356185 x 10 
174.55836 x io 


-6 

-6 


.00073575 x 10* 
5.0752035 x 10* 


.0000000008025 x 10“ 5 + .055356185 x 10“ 5 + .00073575 x IQ -5 


.001745592 


x 10 


_5 


17.455836 x IQ -5 + 5.0752035 X IQ" 5 


0.5609194 x io" 6 g 2 T - .0074895 g = .74895 mg 

oor 327Q X in -6 a 2 r— 

y / = .01501093 g = 15.01093 mg 


Notice that the second term is the primary contributor to the sum in both 
cases. We return to this point at the end. 

Parameter: E(Ca u ). The table of daily means, ~CDR SMEAT URINE, 

gives this parameter in units of milliequivalents (meq) ; multiplying by 

28 2S— , we find as the mean over the entire MBE 
meq 

E(Ca u ) = 246.58 mg. 

Parameter : O z (Ca u ) . We unfortunately do not have available the 

Ca analog to the c-duplicates table for N, whence we cannot "build up" an 
estimate of a 2 (Ca u ) from that for a 2 (c) and thence partition the total 

« An 

a (Ca u ) . Instead we seem obliged to infer a value for O ; (Ca u ) from the 
significant figures in the Ca column of the CDR SMEAT URINE table. As 
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mentioned above, this table column is in meq; the significant figures 
suggest a measurement to within an accuracy of ±0.5 meq. 

If we estimate 0 2 {Ca u ) = 0.5/2 meq 
, = 0.25 (28) == 7.0 mg 


we find ^ 

02(Ca u)data = 49 *° m 9 2 

The Arnold report does, however, give a value 

a 2 (c Ca ) = 14.329859 X 10 -6 (22) 2 

in order to use this value to compute a a 2 (Ca u ) for comparison to its 

A 

data based value. We lack only a value for E(c Ca ) which we obtain as 

the means of CDR's daily values of 

daily Ca u (meq) * 28 
daily urine volume 


This turns out to be 

A 

E(c Ca ) - 0.104585 mg/ml. 


Now using 


<Ca u ) = a 2 (c Ca ) a z (v Ca ) +a z (c Ca ) e z (v Ca ) + e z ( c Ca ) a" (v Ca ) 


yields 

O 2 (Ca u ) . = (14.3298 X 10" 6 (10) + (14.329859 x 10~ 6 ) (2499. 7) 2 

u Arnola 

+ (.104585) 2 (10) 

= .000143298 + 89.53975 + .10938 = 89.649 mg 2 


which primarily reflects Arnold's estimate and is therefore an estimate 
almost independent of S 2 (Ca u ) data . We see that they are quite close, 
confirming our significant figures approach. 

Parameters: E(NET Ca ), O 2 (NET Ca ) . We can now compute the mean 


and variance of the net retention of Ca: 
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I- 


E(NET Ca ) « E(P Ca ) - E (Ca f ) - E(Ca u ) = .9572342 g - .482358 g - .24653 g 
= + .22829 g 


1 i 


% 

■ ■ n 


\ _ r\r\r? rr . J-.5609194 r89.649., v 1 n — 6 „2 

225.3279 49. 


0 z (NET Ca ) = [169.0965 + { 


] x 10" D g‘ 


Again, associating the smaller of both pairs to yield a lower bound 
and the larger to yield an upper bound, we find 

S 2 (HET Ca ) - x 10 ' 6 9 *' 

whence the one-day MBE confidence interval in net retention , due to measure- 
ment errpr only, is 


E(MET Ca ) a(NET Ca ) = .22829 ± .375 g, 


where 0 (NET Ca ) = /218.657 + 484.0714 X IQ" 3 


Here 20 is 16% of E. Thus, the measurements in a Ca-MBE are more 
accurate than those in an N-MBE, where this figure was 65%. The text 
showed that the corresponding result holds for the biological variability 
as well. Note that these two are independent results. 

The following tables show the percent contribution to the total variance 
for the data based and the Arnold based estimates comprising O (NETc a ) . 

Arnold Estimates 
Input Feces Urine 

65.2 0.22 34.6 


Input 

38.1 


Authors 1 Estimates 


Feces 


50.8 


Urine 


11.1 


J 
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Those tables show that the discrepancy between our data-based estimate 

and the corresponding estimate by Arnold is indeed reflected strongly 
in what fraction of the total variance is due to the fecal measurements. 

' But, the sensitivity of c f 2 (NET Ca ) to 0 Z (P Ca ) nevertheless suggests 
that the p-measurement has an importance for Ca-MBE that it does not for 
an N-MBE, i.e., that the fact of the feces being the primary excretory 
channel for Ca reflects itself in an increased importance to the fecal 
parameter p, not the fecal parameter f. 

The dietary measurement is now a primary contributor to 0 (NET Ca ) , 
unlike iri the N-MBE, suggesting that attention to input measurement 
accuracy is more called for in a Ca-MBE than in an N-MBE, and our remarks 

• '*'■■■ . A o 

as to the relative magnitudes of the several contributors to cr (P Ca ) 
accordingly deserve more serious consideration ir„ connection with Ca-MBE 
design than with N-MBE design. 

Chart 8 summarized the estimates obtained by the N data-based 
calculation and by the Ca Arnold-based and data-based calculations. 

We would emphasize, of course, the relatively small contribution of 
measurement error to the total variance in metabolic balance experiments. 

i 



Chart 8. Model-Parameter Estimates that Differ for N and Ca 


Phase 

Parameter 

Unit 

N Data 

Ca (Arnold) 

Ca (Data) 

Input 

E(p) 

g 

16.056 


0.9572342 

Input 

a 2 (p) 

9 Z 

46178 x ICT 6 

— 

1690965 x 10“ 6 

Input 

C.V. 

% 

1.35 

' 

1.36 

Feces 

E(p) 

1 

0.04204058 

— 

0.01715507 

Feces 

a 2 (p) 

1 

0.6709 x ICT 6 

0.000321 x io" 6 

0.10122308 x 10 -6 

Feces 

c.v. 

% 

1.85 

0.10 

1.85 

Feces 

K 

g 

1.6182 

— 

0.712405 

Feces 

E (f ) 

g 0 

38.4914 

— 

41.527 

Feces 

a 2 (f ) 

g 2 

148160 x io 6 

25 x io" 6 

172450 x io~ 6 

Feces 

c.v. 

% 

1.00 

0.01 

1.00 

Feces 

E (* f ) 

g. 

1.0144 

— 

0.482353 

Feces 

a 2 c f ) 

g 

1162.609 x IQ" 6 

0.5609194 x io -6 

225.3279 x 10“ 6 

Feces 

c.v. 

% 

3.37 

0.15 

3.11 

Urine 

E(C) 

g/i 

5.4096 

— 

— 

Urine 

a 2 (c) 

(g/D 2 

21890 x 10" 6 

14.329859 x 10“ 6 

— 

Urine 

c.v. 

% 

2.69 

3.17 

— 

Urine 

E(-u) 

g„ 

13.721 

— 

0.24658 

Urine 

a (*u) 

g 

137100 x io" 6 

89.649 x HP 6 

49 x io- 6 

Urine 

c.v. 

% 

2.70 

3.8 

2.83 


Note ; The C.V. 's in the "Ca (Arnold) "column used the corresponding E(*) from the "Ca(Data)" column 
as its denominator. 





